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FOREWORD 


There  has  long  been  a  need  in  science  and   engineering    for  sys- 
tematic publication   of  research   studies   larger    in    scope   than   a 
journal  article  but  less  ambitious  than  a  finished  book.  Much  valu- 
able work  of  this  kind  is  now  published  only  in  a  semiprivate  way, 
perhaps  as  a  laboratory   report,    and    so   may  not   find   its    proper 
place  in  the  literature  of  the  field.  The  present  contribution  is  the 
fifteenth   of  the   M.I.  T.    Press   Research  Monographs,    which  we 
hope  will  make    selected  timely  and  important  research   studies 
readily   accessible    to   libraries    and   to   the    independent   worker. 

J.    A.    Stratton 


PREFACE 


The  over -all   intent  of  this   book  is  to  show  how  symbolic  logic 
and  the   theory  of  semantic  information   can  be   used  to   analyze 
certain  aspects  of  human   communication.       The    specific  aim  is 
to  construct  a  model  of  how  a  rational  receiver    evaluates   a    se- 
quence of  messages    on  the   basis   of  certain  semantic  and  prag- 
matic properties.  Perhaps  the  most  original  features  of  the  book 
are:      the  logical  analysis  of  questions  presented  in  Sections  6  to 
10,   the   analysis    of   communication  events    in  Section    12,   the 
theory  of  message-processing  in  Section  13,   and  the  analysis  of 
message  adequacy  in  Section  16.     In  the  course  of  the  book  many- 
questions   are   raised  but  not  answered,    and  many  loose  threads 
remain  dangling  at  the  end  of  the  book;   perhaps  the  book,   wher- 
ever it  is  thus    incomplete,   will  have    some   value   in  suggesting 
further  inquiry. 

The  main  line  of  exposition  throughout  should  be  understand- 
able to  the  general  reader.  If  the  reader  has  not  had  some  train- 
ing in  symbolic   logic   or   mathematics,   however,    he   might  find 
some  of  the  definitions  and  theorems  difficult  to  understand,  and 
the  proofs  of  some  of  the  theorems  hard  to  follow. 

Definitions  and  theorems  are  numbered  according  to  their  lo- 
cation; thus  Theorem  7.5  is  the  fifth  theorem  in  Section  7.  Bib- 
liographical references  are  indicated  by  author  and  year;  thus 
•Quine  [i960]'  refers  to  the  work  of  Quine  which  was  published 
in   I960  and  is  listed  in  the  bibliography  at   the  end  of  this  book. 

I  am  grateful  to  the  editors  of  Philosophy  of  Science  for  per- 
mission to  use,  in  Sections  6  to  10,  material  from  Harrah  [1961]. 

I  should  like  to  express  my  thanks  to  the  University  of  Cali- 
fornia for  two  intramural  research  grants  and  a  Faculty  Summer 
Fellowship  which  enabled  me  to  complete  the  manuscript  of  the 
book. 

With  regard  to  the  content  of  the  book,   my  greatest  debt  is  to 
Professor  Nuel  D.  Belnap,   Jr.;    he  read  a  preliminary   draft  of 
Sections   4   to    10,    called  my  attention  to   many  logical   errors, 
and  made  many  useful   criticisms    and  valuable  suggestions.        I 
am   grateful  to   Mr.  David  Kaplan  for   pointing   out  errors    in  an 
earlier   version  of  Sections    7   and   8,   Professor   R.   Sherman 
Lehman  for  valuable    criticisms   and   suggestions    in   connection 
with  Sections  1 1  to  14,    Professor  Donald  Davidson  for  stimulat- 
ing suggestions  in  connection  with  my  analysis  of  questions,   and 
Mr.    D.   P.   Agretelis   and  Professor    Louis   A.   Pedrotti   for   re- 
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search  assistance  in  connection  with  Sections  8  and  10.  To  Pro- 
fessors Oliver  A.  Johnson,  Vernon  A.  Kramer,  and  Carl  G.  Uhr, 
I  am  grateful  for  useful  criticisms  of  preliminary  drafts  of  Sec- 
tions 1  to  3  and  11.  To  Miss  Sandra  Todd  thanks  are  due  for  a 
splendid  job  in  typing  all  of  the  preliminary  drafts  of  the  book. 
Above  all,  I  wish  to  thank  my  wife  for  help  at  the  metatheoret- 
ical  and  metapractical  levels;  she  has,  inter  alia,  taught  me  much 
about  communication. 


Riverside,    California  David  Harrah 

May,    1962 
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1.    AIM    OF    THIS    BOOK 


How  does  a  rational  man  ask  questions?  What  does  he  allow- 
as  acceptable  answers?  How  does  he  evaluate  the  relative  merits 
of  different  answers?  How  does  he  measure  the  informativeness 
of  answers?  the  adequacy  of  answers?  The  aim  of  this  book  is 
to  take  steps  toward  answering  these  questions.  Our  guiding  as- 
sumption is  that  these  questions  can  be  satisfactorily  answered 
only  within  a  theory  of  communication,  a  theory  of  the  sender - 
receiver  situation.  Accordingly,  our  main  effort  in  this  book 
is  to  construct  a  model  of  how  a  rational  receiver  behaves  —  a 
model  of  how  he  asks  questions,  receives  messages,  and  evalu- 
ates the  messages  as  answers  to  his  questions. 

When  we  say  we  intend  to  construct  a  model,  we  use  'model' 
in  the  sense  now  familiar  in  the  social  sciences,  as  distinct 
from  the  sense  in  which  it  is  often  used  in  technical  studies  in 
logic.  That  is,  we  use  'model1  in  the  sense  of  'theory'  or  'pic- 
ture1, rather  than  in  the  sense  of  'realization1  or  'exemplifica- 
tion'.1 Throughout  the  book,  furthermore,  we  shall  use  the  term 
'model'  only  informally,  as  a  heuristic  device.  The  important 
thing  is  our  definitions  and  theorems;  for  heuristic  purposes  we 
may  regard  these  as  hanging  together  to  form  a  theory,  and  hence 
a  model,    of  communication. 

Our  model  is  intended  to  be  a  model  of  rational  behavior;  as 
such,  it  could  be  given  either  a  descriptive  or  a  normative  in- 
terpretation. That  is,  it  could  be  interpreted  as  asserting  that 
men  who  on  other  grounds  are  regarded  as  rational  actually  be- 
have in  a  certain  way  in  communication  situations;  and  it  could 
also  be  interpreted  as  asserting  that  anyone  who  wants  to  be 
rational  in  his  communicative  behavior  must  act  in  a  certain  way. 
For  the  sake  of  efficiency  in  our  inquiry  we  begin  with  simple 
objectives;  we  begin  with  the  assumption  that  our  only  aim  is  to 
construct  a  model  which  will  be  satisfactory  when  interpreted 
normatively.  We  make  no  claims  at  this  stage  that  our  model  is 
a  good  picture  of  actual  behavior.  Nevertheless,  we  want  the 
model  to  yield  prescriptions  which  can  be  followed  in  "real-life" 
situations;  we  can  regard  the  model  as  vindicated  only  if  the 
model  is  shown  to  be  realistic  and  practicable.  Because  of  this, 
we  shall  occasionally  speak  as  though  our  main  objective  were  to 


1  For  a  survey  and  discussion  of  the  uses  of  'model',  see  Suppes  [1960b]. 


2  communication:     a    logical    model 

construct  a  descriptive  model;  we  shall  discuss  empirical  inter- 
pretations of  the  model  and  note  the  relation  between  the  model 
and  actual  communicative  behavior. 

At  a  later  stage  we    shall   try   to    discover    whether    our    model 
is  a  good  descriptive  model;  we  shall  perform  experiments  to  de- 
termine  whether    there    are    certain   types  of  people  and    certain 
types  of  situations  in  which  those  people  behave  like  the  receiver 
in  our  model.       Before   we  can  reach   the  stage   where  empirical 
investigation  would  be  fruitful,    we   must    secure    a  considerable 
formal  development  for  our  model;  it  is  this  formal  development 
which  is  the  task  of  this  book.    The  underlying  assumption  of  our 
whole  procedure  here  is   that  it   is  philosophically  and  scientifi- 
cally legitimate  to  construct  a  normative  model,  then  interpret  it 
empirically,  and  then  experiment  to  discover  its  area  of  descrip- 
tive accuracy.2 


2  For  our   assumption   and  procedure  in  these  respects,    there  are  re- 
spectable  precedents  in   the  social    sciences.     See,    e.g.  ,    Adams  [I960], 
pp.    182-183;     Suppes  [I960  c],   p.    34;     Von  Foerster    [1952],   pp.    19-44; 
Lanchester  [1956];    and   Luce  [I960],   p.    67. 


2.    TOOLS   WE   SHALL  USE 


For  some  purposes  a  theorist  must  specify  his  theory  in  a 
fully  explicit  and  completely  formal  way;  he  must  list  in  advance 
all  of  the  assumptions  and  procedures  of  argument  which  he  in- 
tends to  use,  and  he  must  acknowledge  every  use  of  them.  For 
other  purposes,  however,  a  less  formal  exposition  is  satisfac- 
tory; rigor  is  achieved  where  it  is  important,  but  complete  ex- 
plicitness,  either  in  assumptions  or  arguments,  is  not  attempted. 
In  this  book  we  shall  try  to  be  as  precise  as  possible  in  formu- 
lating our  definitions  and  theorems;  but  we  shall  not  list  all  of 
our  assumptions  explicitly,  and  we  shall  not  supply  all  of  the 
logically  necessary  steps  in  the  proofs  of  our  theorems.  Con- 
cerning the  unlisted  assumptions  we  can  only,  for  the  present, 
hope  that  they  are  not  controversial. 

To  formulate  and  explore  the  properties  of  our  model,  we 
shall  need  a  number  of  tools.  One  tool  we  shall  avail  ourselves 
of  is  a  small  fragment  of  one  version  of  the  English  language. 
We  shall  refer  to  expressions  by  means  of  single  quotation  marks; 
the  word  'hill5,  for  example,  will  be  named  by  '  'hill'  '.  Double 
quotation  marks  will  be  used  for  naming  inside  another  name,  as 
in  the  sentence  'Russell  wrote  "On  Denoting"  ';  they  will  also  be 
used  to  indicate  unusual  or  metaphorical  usage,  as  when  we  say 
that  a  poet  manipulates  the  "deeper"  meanings  of  language.  We 
shall  use  underlining  to  indicate  that  an  expression  is  being  de- 
fined; occasionally  underlining  will  be  used  to  express  emphasis, 
and  also  to  indicate  that  we  are  talking  about  a  concept,  as  in 
'the  concept  of  rationality'.  Sometimes  we  shall  refer  to  linguis- 
tic expressions  by  setting  them  apart  on  lines  of  their  own;  we 
might  name  the  sentence  'This  sentence  is  false'  by  writing  it  thus 

This  sentence  is  false. 

Occasionally  also,   where   there   is  no   danger  of  ambiguity,   we 
may  even  let  an  expression  serve  as   a  name  of  itself;    thus  we 
might  let  the  letter  'x',  without  quotes,  stand  as  a  name  of  itself. 
We  shall  also  need  some  of  the  tools  provided  by  modern  logic,. 
We    shall   interpret    '-p'    as    'Not   p',     '(p    &  q)'    as    'p   and   q' , 
'(P   v   q)1    as    *P   or    9.'    ^n  ^e  inclusive  sense  of  'or',    '(p   -*■   q)'  as 


1  Concerning  formality  and  permissible  deviations  from  it,   see  Carnap 
[1939];    Woodger  [1939],    Chap.    VI;     and  Rosser  [1953],    Chap.   V. 
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•If  p   then  q',     '(p  « — »•  q)1    as    'p   if  and  only  if  q' ,    '(a  =  b)'    as    'a 
is  identical  with    b1,    '(x)F'    as    'Every    x    is    such  that    F1,    and 
'(Ex)F'    as    'Some    x    is  such  that    F'.     We  shall  feel  free  to  draw 
on  the  whole  of  the  usual  truth-functional  logic  of  sentences,  and 
in  particular  to  assert  any  tautology.     We  shall  also  draw  freely 
on  the  whole  of  the  usual  logic  of  first-order  predicates  and  iden- 
tity,  and  in  particular  take  advantage   of  the   usual  theorems   of 
that  logic.2 

We  shall  also  utilize  several  concepts  and  assumptions  of  set 
theory,  restricting  ourselves,  however,  to  set  theory  of  the 
Zermelo-Fraenkel  type.3  In  particular  we  shall  make  frequent 
use  of  the  notions  of  set  and  membership  in  a  set.  To  refer  to 
a  set,  we  shall  use  either  a  description  of  the  form  'the  set  of 
entities  such  that  .  .  .'  or  else  a  list  of  the  members  of  the  set. 
Occasionally  we  shall  use  'x  e  y1  to  mean  'x  is  a  member  of  y1. 
Concerning  the  existence  of  sets  we  shall  assume  the  following: 

(1)  If  we  may  speak  of  the  entities    Xj,    .  .  .  ,   x    ,   then  we 
may  speak  of  the  set  whose  only  members  are- x,  ,    .  .  .  ,   x 
If    y    is  a  set  and    F  is    a   one-place   predicate   which  is~ 
"definite"   for  all  the  members  of    y  —  that   is,   for    each 
x    in    y  ,    either    Fx    or     -Fx    is  true  —  then  we  may  speak 
of  the  set  which  contains  just  those  members    x    of    y  for 
which    Fx    is  true.4    We  may  speak  of  sets  which  contain 
no  members;    these  will  be  called  null  or  empty. 

(2)  If  we  have  asserted  (x)[(x  e  y)  < — *•  (x  e  z)]  ,  then  we  may 
assert  (y  =  z),  and  conversely.  If  we  have  asserted 
(x)[(x  e  y)  -*■  (x  e  z)]  ,  then  we  may  assert  that  y  is  in- 
cluded in  z  ,  and  conversely.  If  we  have  asserted  that 
y  is  included  in  z  and  y  is  not  identical  with  z  ,  then 
we  may  assert  that  y  is  properly  included  in  z  ,  and 
conversely.  If  x  is  included  in  y,  we  call  x  a  subset 
of    y  . 

(3)  If  we  may  speak  of  the  sets    xx 2£n  »   then  we   may 

speak  of   (a)   the  union  of    x,,    .  .  .  ,   x    ,T:hat  is,    the  set 
of  those  things    y    which  are  in  either-^  ,    or    .  .  .    or   xn; 

(b)  the  intersection  of    x   ,    .  .  .  ,   xn  ,   that  is,   the    set  of" 
those  things    y    which  are  in  both    x.    and    .  .  .    and    xn  ; 

(c)  the  difference  of  x2  and  *2  '  that  is'  the  set  of  th°se 
things  y  which  are  in  x  but  not  x2  >  (&)  t^ie  power  set 
of    x,  ,   that  is,   the  set  of  all  the  subsets  of    xx . 

(4)  If  we  may  speak  of  the  entities    x,  >    •  •  •  >   2£n  '    ^en  we 
may  speak  of  the  ordered  set  or  ordered  n-Tuple  of  the 
entities    x.,    .  .  .  ,   x      arranged  in  that  order;     this  or- 
dered n-tuple  will  be"  denoted  by   '<  x  ,   .  .  .  ,    x.n>'  .    Oc- 


2  See  Church  [1956],    Chaps.   I  and  III. 

3  See  Fraenkel  [1958]  and   Suppes  [1960  a]. 


Fraenkel  [1958],   p.  38. 
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casionally  we  shall  call  <x  .  .  .  .  ,  x  >  a  sequence  and 
refer  to  it  by    ,xJ ,  .  .  .  ,  Xn1.  If  "we   have   asserted 

[<x1»  •  •  •  ,  2£n>  =  <v-i'  •"■  ■  i  Y^]  >  then  we  may  as- 
sert [(x1  =  Yj)"^  .  .  .  &  (2Sn  =  Yn)]  »~ and  conversely.  If 
x  is  a  set  of  ordered  n-tuplesT  then  we  may  regard  x 
as  an   n-ary  relation. 

In  regard  to  relations  we  shall  use  standard  terminology,    as 
follows: 

F    is  reflexive  if  and  only  if    (x)  (Fxx)  . 
F_    is  symmetric  if  and  only  if    (x)  (y)  (Fxy  ->  Fyx)  . 
F    is  transitive  if  and  only  if    (x)  (y)  (z)  [(Fxy  &  Fyz)  -*■  Fxz]  . 
F    is  irreflexive  if  and  only  if    (x)  (-Fxx). 

F    is  nonreflexive  if  and  only  if  F   is  neither  reflexive  nor 
irreflexive.     And  so  on. 

We  shall  also  use  several  concepts,  assumptions,  and  techniques 
of  mathematics.    One  important  technique  that  we  shall  use  is  fi- 
nite mathematical  induction.    That  is,  we   may  assert  that   every 
positive  integer  satisfies    F,  provided  that  we  have  already  shown 
F(l)     and    [F(n)  ->  F(n  +  1)]     for  any  positive  integer    n.       An   as- 
sumption which  will  be  important  in  connection  with  message  pay- 
ment is  the  real-number  axiom:   If  a  set  of  real  numbers  is  non- 
empty and  has  an  upper   bound,    then  it  has  a   least  upper   bound. 

In   connection  with  logical  formulas  involving   parentheses   we 
shall  occasionally  use  the  following  devices  of  abbreviation: 

(1)  Outermost  parentheses  maybe  dropped. 

(2)  '  ■* — >•    and  '  -*■'    dominate    '  &'    and    'v',    which   dominate 

1  -  f,    which  dominates    •  (x)  '    and    '  (Ex)  • ,    which  dominate 
1  Fx',    'e',    and     '  =  '. 

(3)  Parentheses  may  be  dropped  from  nested  conjunctions  and 
from  nested  disjunctions;    parentheses  are  to  be    restored 
in  accordance  with  a  convention  of  association  to  the  left. 

(4)  Parentheses  may  be  replaced  by  square  brackets. 
Thus  the  formula 

(  (  (  (x)  (x  e  y)  v  -  (x  =  y)  )  &  (  (p  v  q)  v  r)  )  ->  r) 

may  be  written 

[(x)  (x  e  y)  v  -  x  =  y]  &  (p  v  q  v  r_)  -*  r_ . 

The  formula    (p  v  q  v  r),    for  example,  will  be  regarded  as  a  dis- 
junction with  three  disjuncts;   conjunction  will  be  treated  analogously. 

In  addition  to  using  the  logical  and  mathematical  apparatus  men- 
tioned above,  we  shall  also  use  assumptions  of  an  empirical  nature. 
Many  of  these  concern  the  psychology  of  human  beings  in  commu- 
nication, game,  and  decision  situations.  We  shall  assume,  for  ex- 
ample, that  in  certain  communication  situations  the  receiver  wants 
to  obtain  as  much  information  as  he  can,  for  as  little  cost  as  possible. 


3.    MESSAGE   EFFECT 


The  situation  we   wish  to   analyze   is   the    situation  in  which  a 
message    m    affects  a  receiver    R.    With  an  empirical  interpre- 
tation in  mind,  we  could  speak  of  the    "impact"   which    m    has  on 
R  ;     with  a  normative  interpretation  we   may   speak  of  how     R 
evaluates,    or  ought  to  evaluate,   m. 

We  wish  in  particular   to   analyze  the   impact,   or  value,    of    m 
in  terms  of  how  it  answers  some  questions    which    R    asks;      let 
us  say  that  these  questions  compose  a  set    Q.   When    R    receives 
m  ,   he  does  not  evaluate    m    by  itself;    he  evaluates    m    in  light 
of  all  of  his  background  knowledge.       But  the   message  and    R's 
knowledge  must  be  "commensurate"  with  each  other  in  the  sense 
that  logical  relations  can  hold  between  them.  The  natural  course 
for  our  analysis,    then,    is   to   let    R    have   a  language    L    with  a 
logic  capable   of  handling   declarative    sentences,   and  to  let   the 
message    m    and    R's    knowledge    k    be  declarative  sentences  of  L. 

The   questions  in    Q    can  also  be  taken  as  sentences  of  a  cer- 
tain sort,  formulable  within    L  .   In  Sections  6  to   9   we    shall   de- 
vise a  way  of  analyzing  questions  so  that  the  questions  in    Q,    and 
their  answers,    can  have    satisfactory  logical   relations   with     m 
and    k  .       It  is  perhaps  worth  pointing  out  here  that  the  notion  of 
Q    is   a   very  general   and  important  notion  for   any  theory  of 
communication.       One  reason  is   that    Q    represents    R's    inter- 
ests,  in  the  following  way:      R    is   interested  in  various    subject 
matters;    a  subject  matter  can  be  construed   as  a  set  of  terms 
and  predicates  of    L    (or,    more   generally,    as  a  set   of  expres- 
sions of    L);      and    R    can  be  regarded  as  articulating  his  inter- 
est in  the  subject  matter  by  asking  questions  which  are  formu- 
lated in  terms  of  the  subject  matter.        A   second   reason  why  Q 
is  important  is  that    Q    can  provide    R    with  test   questions,   and 
thus  with  a  means  for  cross-examining  the  sender.     The   nature 
of  the  sender    S    will  be  clarified  later. 

When    R    receives    m,    he  tries  to  use  it  to  answer  his  ques- 
tions.      He    considers   his   questions    one  by  one,   and  for    each 
question  he   tries   to   deduce   an  answer;      he  tries  to  deduce  the 
answer  from    m  ,   with  the  aid  of    L    and    k  .  In  this   analysis 

m  will  be  informative  —  that  is,  bring  "news"  —  to  the  extent 
that  it  helps  to  answer  questions  which  are  not  answerable  with- 
out    m ,    and    R    can  evaluate    m    accordingly. 

Moreover,  R  can  make  rough  evaluations  of  messages  solely 
on  the  basis  of  L,  k,  and  Q.  For  example,  he  can  define  the 
adequacy  of    m    as  the  ratio  of  the   number   of  questions   in    Q 
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which  are  answerable  by  the  conjunction  of    m    and    k  ,   but  not  by 
k    alone,    to   the  total   number  of  questions  in    Q.     Also,     R     can 
assess  the  value  of  a  set  of  messages  in  terms  of  how  completely 
the    messages    answer    his    questions,    how   few  messages    are 
required  to  answer  all  of  his  questions,   how  early  in  the  message 
sequence  the  crucial  ones  occur,    and  so  on.    To  allow   R   to  make 
finer  evaluations,    however,    we  must  add  some  evaluation  func- 
tions to  our  model. 

It  seems  futile  to  search  for  a  single  function  giving  "the  value" 
of  a  message.     Consider,    for  example,    the  message  'You   won't 
have  to  pay  any  hospital  bills  because   your   wife   just   murdered 
your  cancer -stricken  son'.     Surely  this  message    can  have  many- 
kinds  of  value;     R     can  evaluate  it  on  many  dimensions.         All  we 
shall  attempt  here  is  to  explore  the  properties  of  one  particular 
evaluation  function,    an  information  function    I  .    Having    I   avail- 
able allows    R    to  evaluate  the  information-  or  news-value  of    m, 
and  more  generally  of  a  sequence  of  messages.       Concerning    I, 
we    shall   make   as   few  assumptions   as   possible,   but  we    shall 
nevertheless  have  to  limit  our  consideration  to  just   one  class. of 
several  possible  classes  of  information  functions.   In  the    course 
of  our  discussion,   however,    we  shall  comment  on  the  possibility 
of  adding  to  our  model  other  information  functions  and  evaluation 
functions  of  other  types. 

There  are  many  ways  in  which  R  can  use  his  L  ,  k  ,  Q  ,  and 
I  ,  but  in  this  book  we  shall  consider  only  one.  Our  main  effort 
here  is  to  work  toward  a  scoring  system  based  on  L,  k,  etc.  , 
whereby  R  can  assign  various  scores  to  messages  received 
over  a  period  of  time.  By  means  of  this  system  R  can  repre- 
sent S's  behavior  graphically:  He  can  construct  profiles  which 
plot,  for  example,  amount  of  information  against  time,  or  mes- 
sage adequacy  against  message  number.  If  this  scoring  system 
is  a  good  one,  it  should  be  possible  to  correlate  these  profiles  of 
S's    message  behavior  with  other  aspects  of    S's  behavior.  It 

should  be  possible,   for  example,   to  find  some  types    of  profiles 
which  are  characteristic  of  frauds  and  ignoramuses,    and   others 
which  are  characteristic  of  honest  men.        Thus     R    can   use   his 
scoring  system  to  make  a    "second-order"    evaluation  of  the  mes- 
sage sequence.     If  he  pays  cash  for  messages,   he  can  delay  pay- 
ment for  a  certain  length   of  time,    during   which   he    can   cross-, 
examine    S  ,   verify  the  messages,    and  analyze  the  profiles  of  S's 
behavior;    he  can  continue  to  delay  payment  as  long   as   the   pro- 
files have  any   "warning    signs"   about  them,   or   when  he   finally 
does  pay,    he  can  scale  down  the  payment  in  accord  with  the  warn- 
ing signs. 

Our  model  allows    R    great  flexibility  in  the  use  of  his    I_    func- 
tion.    Perhaps    R    will  pay  cash  to    S    only  on   the  basis  of   still 
other  functions,  using    I    merely  for  record-keeping  and  analytical 
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purposes;    or    R    may  use    I    both  for   record-keeping   and  for 
making  payments.     Throughout  most  of  this   book  it  will  not  be 
necessary  to  distinguish  between  various  payment  systems,    and 
we  shall  occasionally,    for   heuristic   purposes,    speak  of    R    as 
paying  cash  for  messages  on  the  basis  of    I. 

>!<      *      -f     >!«     -!< 

Given    L ,   k ,    Q ,    and    I ,    it   is   easy  to    construct   a  model   of 
communication  situations  of  a  relatively  simple  type.  We  can  de- 
fine   R    as  the  ordered  quadruple    <  L. ,   k,    Q,    I>,    and   commu- 
nication events    E    as  ordered  pairs    <R,   m>,   where    m     is   a 
declarative  sentence  of    L.     A  communication  sequence   is  a  se- 
quence of  communication  events,   different  types  of  sequence  be- 
ing defined  by  different  relationships  among  the    m's   and  among 
the    R's  .     A  sequence  of  communication  events  with  one  common 
R    but  different    m's    could  be   taken  as   the    standard  model   of 
static  communication  situations,    such  as  the  situation  of  a  per- 
son reading  a  book.     A  slightly  more  general  model  is  that  of  the 
interview  situation,    in  which    R's    knowledge  and  interests   may 
grow  during  the  course  of  the  communicative  interaction;       this 
situation  could  be  modeled  by  a  communication  sequence  with  the 
following  properties:    All  the    L's    are  identical,    all  the   I's   are 
identical,    each  later    Q    includes  all  earlier    Q's,    and  each  later 
k    implies  all  earlier    k's. 

The  most  interesting    communication  situations,  however,    are 
not  book-reading  or  interview  situations.       More   interesting  are 
situations    of  education,    conversion,  or  persuasion  —  including 
political  campaigns,  courtroom  battles,  controversy  between  sci- 
entific or  philosophical  schools,  and  the  like.  In  these,    R  might 
change  his  mind  between   Ej    and  E2  in  such  a  way  that   (kj  &:  k2) 
is  inconsistent.  His  change  of  mind  may  be  due  to   S's   messages, 
or  it  may  represent  a  self-induced  insight;   in  either  case  we  want  a 
general  "economics  of  persuasion"  which  can  allow  for  such  cases. 

In  order  to  obtain  the  desired  generality  we  shall,  both  explic- 
itly and  implicitly,  introduce  references  to  belief.    Instead  of  try- 
ing to  define  R  as  an  abstract  quadruple,  we  shall  assume   R   as  an 
unanalyzed  parameter;   thus   R    could  be  a  human  being,    a  business 
corporation,  or  a  nation.  We  shall  not  assume  that  k   is  true,   but 
merely  that    k    is  consistent  and  that    R    believes  that    k    is  true. 
The  assumption  of  consistency  is  useful  for  the  proof  of  some   of 
our  theorems,   and  in  any  case  it  is  a  reasonable  requirement   to 
impose  on  a  rational  R  .  When  we  characterize   R's    information 
function    I,    we  shall  make  the  reference  to    R's  beliefs    explicit. 
On  the  whole,  the  accommodation  of    R's    belief s  in  our  model  will 
make  parts  of  the  model  somewhat  inelegant,  but  on  the  other  hand 
it  should  render  the  model  somewhat  easier  to  understand  and  interpret. 

■fc        jk        jfe        jfe        jfe 
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The  subtlety,    power,    and  generality  of  our    analysis    of  mes- 
sage effect  will  depend  on  the  assumptions   we   make    regarding 
L»   k,    Q»    I.    and    m.     For    m    we  choose  to  stipulate    only   the 
minimal  restriction  that    m     is    a   declarative    sentence   of     L; 
thereby  our  model  can  apply  to  cases  where     R    does    not  know 
in  advance  what  kind  of  communication  he  may  be  receiving  — 
information,    entertainment,    or  "poudre  aux  yeux".     For    L,    on 
the  other  hand,   we  shall  make  a  number  of  detailed  assumptions 
whose  effect  is  to  give    L    a  relatively  powerful  logical  system; 
these  assumptions  thus  cut  down  the    generality  of  our   model, 
but  they  make  possible  a  great  gain  in  analytical  power,    partic- 
ularly in  the  analysis    of  questions    and  answers,    and  we    shall 
try  to  show  that  they  are    reasonable    on   other    grounds    as    well. 
For  all  the  components  of  the  model    our    aim   will  be    to    formu- 
late our  assumptions    in   enough   detail   to    secure   the   analytical 
power  we  need  for  the  purposes  of  this  book,    without  committing 
ourselves  to  details  which  cut  down  the  generality  of  our   model 
for  no  good  reason.     Our  policy  in  this  respect  will  appear  most 
clearly  in  the  case  of    S  ;     since  we  are  concerned   only  with  an 
analysis  of  message  effect,  we  shall  identify  S   with  the  sequence 
of  messages. 

Now  it  might  be  thought  that    since   we   want   to    study  the   be- 
havior of  a  rational    R,   we  should  specify  a   rational     L  ,    a   ra- 
tional   k,    and  so  on.     Unfortunately  the  meaning  of  'rational'    is 
in  dispute,    and  we  wish  to   avoid   controversies    on  this   matter, 
so  we  shall  not  presume  to  be  dealing  with  an    R    who  is  rational 
in  any  absolute  or  universal  sense.     As  the  first  step  in  narrow- 
ing our  focus  and  our  claims,    we  decide  to  consider  only  the  ra- 
tionality of   "scientific  man"    or    economic   man   or    logical   man. 
But  it  can  be  argued  that  further  narrowing  is  needed.    It    can  be 
argued  that,    although  we  know  in  a   rough  way  what    science   is 
and  can  point  to  the  tradition  of  scientific  activity,    we  neverthe- 
less cannot  define  'science'    or  'scientific'  in  a  way  which  is  suf- 
ficiently precise  for  any  purposes  other  than  those  of  historians 
of  science.1     Here  we  shall  assume   that    'scientific    rationality' 
needs  further  explication  and  that  the  explication   may  never    be 
complete,    that  we  have  some  rough  notions  about  what  scientific 
rationality  is  or  ought  to  be,    that  we  may  use  these  rough  notions 
to  guide  our  analysis  of  rational  communication,    and  that  we  may 
at  least  for  heuristic  purposes  regard  our  effort  to  analyze  com- 
munication as  a  contribution  toward   explicating    'scientific   ra- 
tionality' . 

The  rationality  of    ~L_ ,    Q,    and    I    will  be  discussed  separately 
later.     We  shall  have  occasion  to  ask  questions    such  as    'How 
does  a  rational  man  ask  questions?',    to   make    assumptions    like 


1  For  arguments  supporting  this  position,  see  Gallie  [1957,    1959]  and 
Harrah   [1959]. 
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•The  safest  course  for  a  rational  receiver  in  this  situation  is 
.  .  .  *,  and  to  use  such  terms  as  'clear',  'precise',  and  'fruitful'. 
In  all  such  usages  our  general  orientation  is  to  the  tradition  of 
Western  science,  broadly  conceived.  When  we  make  our  final 
specifications  for  L. ,  Q,  and  I,  however,  we  shall  make  some 
specifications  which  exclude  alternatives  even  within  the  West- 
ern scientific  tradition.  Thus  we  can  claim  only  that  our  model 
represents  one  type  of  scientific  rationality. 

Whether  it  is  a  good  model  of  scientific   man,    and  whether    it 
is  also  a  good  model  of,    say,   legal  or  logical  or  economic  man, 
remain  to  be  seen.     In  any  case,   we  do   not  here    claim  that  the 
results  of  the  main  analysis  of  this   book   hold   for    other    models 
of  scientific  man,    or  for  models  of  other  types.     Perhaps  no  one 
model  can  be  an  adequate  model   of   scientific   man   in  the    sense 
that  it  applies  adequately  in  all  situations  which  we   want   to    call 
scientific.  Perhaps  no  model  of  scientific  man  can  be  an  adequate 
normative  model  for,    or  empirical  model  of,   nonscientific  man. 
However  this  may  be,    in  the  course  of  our  analysis  of    R's    be- 
havior   it  will  become   apparent  not  only  that     R     behaves   like 
scientific  man  in  some  respects,   but  that  he  can  also  enjoy  some 
of  the  amenities  and  exaltations  supposedly  reserved  for  nonsci- 
entific man,    such  as  being   able   to   ask  unanswerable   questions 
and  appreciate    "depth-meanings". 


4.    THE   RECEIVER'S    LANGUAGE 


Of  all  the  components  of  our  model,  the  fundamental  one  is  L  . 
Both  m,  k,  and  the  questions  in  Q  are  sentences  of  L  ,  and  I 
is  a  function  which  is  applied  to  sentences  of    L_ .  Accordingly 

we  begin  construction  of  our  model  by  specifying  our  assumptions 
about    L  . 

To  talk  about    L    we  need  language  of  some  sort;  we  shall  call 
the  language  in  which  we   talk  about    L    the  metalanguage   of    L  , 
or    ML    for  short.       We  talk   about    ML    by  using    MML,    and  so 
on.  Following  the  standard  procedure  in  regard  to  metalanguages,1 
we  assume  that    L,    ML,    MML  ,   ....    are  distinct;     we  assume 

that    ML  ,    MML are    sufficiently   rich  so   that   we  can  use 

ML,  MML  ,  .  .  .  ,  to  say  about  L,  ML,  .  .  .  ,  everything  we 
want  to  say.  Following  customary  practice  again,  we  shall  not 
state  explicitly  all  of  the  content  of  ML,  MML  ,  ....  We  as- 
sume that  we  have  ML,  MML  ,  .  .  .  ,  available  as  part  of  the 
framework  within  which  this  analysis  as  a  whole  is  being  devel- 
oped. Part  of  the  content  of  our  metalanguages  has  already  been 
specified  in  Section  2  ;  additional  specifications  will  be  made 
later.  When  we  use  our  metalanguages  to  talk  about  L  ,  we  shall 
strive  not  for  complete  precision  of  statement  and  complete  rigor 
of  proof,  but  only  for  precision  and  rigor  sufficient  for  the  pur- 
poses of  this  book. 

Now  it  is  not  only  we  who  develop  and  use  this  model  who  need 
a  metalanguage  for  talking  about  L;  according  to  our  model,  R 
also  needs  to  be  able  to  talk  about  L  .  If  R  is  aware  of  what 
the  model  says  he  is  doing,  or  if  he  wants  to  use  the  model  as  a 
norm  for  what  he  ought  to  do,  then  he  must  be  able  to  refer  to 
L  .  He  must  be  able  to  say,  for  example,  'The  expression  F  is 
a  sentence  of  L'  ,  or  'F  is  not  an  answer  to  a  question1.  To 
simplify  our  analysis  we  shall  assume  that  R ' s  metalanguage, 
metametalanguage,  and  so  on,  are  identical  with  our  own  ML, 
MML,   .  .  .  ,    respectively. 

Our  intention,  which  will  be  expressed  by  the  explicit  assump- 
tions below,  is  that  L  has  the  following  structure:  It  has  an 
inner  core,  which  we  shall  call  the  "standard  part"  of  L;  this 
contains  names  of  individuals,  predicates  of  individuals,  and  the 
usual  logic  of  such  predicates,  quantifiers,  and  identity.  The 
standard  part  of    L    also    contains    other    linguistic,    logical,    and 


1  Following    Tarski  [1956],    especially   Chap.  VIII,  and  Church   [1956], 
pp.    47-68. 
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mathematical  resources,  including  perhaps  a  theory  of  sets  suf- 
ficient to  serve  as  a  foundation  for  mathematics,  and  some  non- 
logical  postulates  concerning  the  names  and  predicates.  The 

standard  part  of    L,   furthermore,    is  consistent.  Finally,  outside 
the  standard  part,    L    might  also  contain   still  further   linguistic 
and  logical  apparatus,  such  as  large  parts  of  the  English,  Chinese, 
and  Russian  languages.     We  allow    L    to  have  a  nonstandard  part, 
and  to  have  "nonlogical"  content  in  its  standard  part,  for  the  sake 
of  generality  in  our  model  of  communication.        We  may  think  of 
R    as  using   the   nonstandard  part   of    L ,    and  also    ML ,    MML , 
and  so  on,    for  various  types  of  thinking  and  communicating,    in- 
cluding his  informal,    over -all  reasoning.  He    resorts   to   the 
standard  inner  core  of    L    when  he  wants  to  think  and  communi- 
cate about  certain  matters  in  a  rigorous  way  —  in  particular, 
when  he  wants  to  ask  questions  of  certain  types  and  analyze   the 
logical  content  of  certain  messages.     Though    L    may   contain  a 
variety  of  apparatus,    all  we  shall  work  with  in  this  book,  all  we 
shall  specify  in  detail  and  take  advantage  of,    is  the  logical   core 
explicitly  specified  for  the  standard  part. 

For  the  obvious  connotation,   we  label   the   important   assump- 
tions about    L    'LA  1'  ,    'LA  2'  ,    etc.  Every  paragraph  begin- 
ning with  such  a  label  is  an  assumption;       unlabeled  paragraphs 
following  an  assumption  give  informal  comment  on  the   assump- 
tion.    If  a  word  or  phrase  occurring  in  an  assumption,    or  in  the 
comments  given  in  connection  with  an  assumption  or  theorem,  is 
underlined,   then  the  sentence  containing  the  phrase  may  be  con- 
strued as  a  definition  of  the  phrase,    or  of  the  concept  expressed 
by  the  phrase.     It  will  be  obvious  that  there  is  considerable   re- 
dundancy in  our  assumptions  about    L  ;    this  redundancy  has  been 
introduced  in  order  to  simplify  our  exposition.   It  will  be  obvious 
likewise  that  in  formulating  our  assumptions   about     L     we   have 
recourse  to  metalinguistic  levels  higher  than  that  of    ML  . 

Occasionally  we  shall  mention  effective  specification.    When  we 
say   'All  the    y' s     such  that  .  .  .  are  effectively  specified',  we  shall 
mean  that  the  user  of   L  has  effectively  in  his  possession  a  method 
by  which  in  a  finite  time   he    can  mechanically   construct   any     y 
such  that  .  .  .   and  a  method  by  which  in  a  finite  time  he  can  me- 
chanically recognize  whether  or  not   a   given   entity  is    a    y    such 
that  ....    When  we  say   'There  are  effectively  specified   y's    such 
that  .  .  .  '  ,   we  shall  mean  that  there  exists  a  nonempty  set  whose 
only  members  are    y's     such  that  .  .  .  ,    and  that  the   user    of    L 
has  effectively  in  his  possession  a  method  by  which  in  a  finite 
time  he  can  mechanically  construct  any  member  of  that  set  and  a 
method  by  which  in  a  finite  time  he    can  mechanically  recognize 
whether  or  not  a  given  entity  is  a  member  of  that  set.     Thus,   to 
say  that  there  are  effectively  specified  y's  such  that  ...   is   not 
to  say  that  all  of  the   y's    such  that  .  .  .  are  effectively  specified. 
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To  refer  to  the  structure  of  expressions  of    L,   we    shall  use 
the  following  auxiliary  notion:     The  concatenation  of    y   ,   .  .  .  ,  y 
is  the  result  of  writing    y      followed  by   .  .  .    followed  by    y_.  Oc~ 
casionally  we  shall  refer  to  the  concatenation  of    F.  ,  .  .  .  ,~F     by 
means  of  the  concatenation  of    'F  '  ,  .  .  .  ,  'Fn'  .  Thus  we  may- re- 
fer  to   the    concatenation  of    F    and     G     by    'FG'  .        We   assume 
that  concatenation  is  an  operation  which   can  be   performed  me- 
chanically,  that  concatenations  are  unique,   and  that   the    concat- 
enation of    y, y      can  be  mechanically  recognized  as  being 

the  concatenation  of    y~  ,  .  .  .  ,  y    . 

By   'expression  of    L'  we  shall~mean  the  concatenation  of  a  fi- 
nite sequence  of  signs  in  the  alphabet  of    L .     To  refer  to  expres- 
sions of    L  ,   or  sets  of  expressions  of    L. ,   we    shall  need  vari- 
ables in  our  various  metalanguages.     Our  stock  of  variables  for 
this  purpose  will  be  all  of  the   letters   of  the   English  alphabet, 
both  lower-  and  upper-case,  with  or  without  numerical  subscripts 
and  prime -mark  superscripts,    and  usually  underlined.   We    shall 
sometimes  use  these  variables  by  themselves,  to  refer  to  expres- 
sions;   and  we  shall  sometimes  use  them  as  operators,   to   refer 
to  expressions  in  which  other  expressions  might  occur.     For  ex- 
ample,   'Ft'   may  be  used  to   refer    to   expressions   in  which  the 
term  referred  to  by    't'    might  occur. 

In  all  of  our  assumptions  about    L  ,    'interpreted  as   referring 
to'    will  mean   'interpreted  as  referring  to  and  only  to';       simi- 
larly for    'interpreted  as  denoting'    and    'interpreted  as    desig- 
nating or  expressing'.     'Interpreted'    will  mean    'interpreted  ac- 
cording to  an  effectively  specified  rule   of  interpretation  of    L'  . 
'True'    will  mean    'true  in    L' ;     similarly  for    'false1,  'expresses', 
'designates'  ,     'denotes'  ,    'meaningful'  ,    and   'meaningless'  . 

LA    1:   The  standard  alphabet  of    L    is  the  set  consisting  just 
of  the  standard  signs  of    L,    namely: 

(  ,     )  ,    *-.     'i    =  i     -i    &,    j_,    £ ,    i 

The  alphabetic  order  of  the  standard  signs  of  L   is  the 
order  in  which  these  signs  have  been  listed  above.       A 
standard   expression   of    L    is   the  concatenation  of  a 
finite  sequence  of  standard  signs  of    L  .        The   lexical 
length  of  a  standard  expression    F    of    L    is    n     if  and 
only   if    F    is    a    sequence     s_x  ,   .  .  .  ,  s        of   standard 
signs  of    L.       A   standard   expression-  F    of    L    is   im- 
possible   if  and   only   if  the    lexical  length    n    of    F     is 
such  that    0  <  n  ^  r_  <  oo  .       The    alphabetical   order    of 
the  standard   expressions    of    L    is    defined   as  follows: 
(1)   If    F    and    G     are    standard   expressions    and  the 
lexical  length  of    F    is  less  than  the  lexical  length 
of    G,    then    F    precedes    G    in  the   alphabetical 
order. 
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(2)   If    F    and    G    are  standard  expressions  having  the 
same   lexical   length  and  identical    signs   in  their 
first    m-1  places,   and  the    m        sign   of     F    pre- 
cedes the    m        sign  of    G    in  the  alphabetic  order 
of  the  standard  signs  of    L  ,    then    F    precedes     G 
in  the  alphabetical  order. 

From  LA    1  we  can  obtain  the  following  theorems: 

Theorem  4.1:   Every  nonempty  set  of  standard   expressions 
of    L    has  exactly  one  alphabetically  first  member. 

Theorem  4.2:   For  every    r  ,   the  number  of  r-possible  stand- 
ard expressions  of    L    is  finite. 

By  'r-possible'  we  intend  to  connote  'length  of  no  more  than 
r  units'  and  also  'possible  for  R'.  To  delimit  the  class  of  ex- 
pressions which  R  must  be  prepared  to  confront  in  practice,  he 
can  use  the  notion  of  r -possibility.  With  this  device  available 
we  can  allow  the  vocabulary  and  grammar  of  L  to  be  infinite  in 
certain  ways.  As  we  shall  see,  R  can  form  any  number  of  vari- 
ables and  constants,  and  predicates  of  any  degree  of  complexity; 
thus  R  has  infinitely  rich  linguistic  resources  at  his  disposal, 
and  there  is  no  theoretical  limit  to  the  complexity  of  the  ques- 
tions he  may  ask  or  the  answers  he  may  receive. 

LA   2;    There  exists  a  nonempty  domain    D    of  individuals  which 
is  sufficiently  well  specified. 

This  domain  is  the    "universe  of  discourse"   of    L.     By  assum- 
ing that  it  is  sufficiently  well  specified,   we  mean  that  the  user  of 
L    can  specify  clearly  enough  what  is  and  what  is  not   in  the   do- 
main.2   For  the  sake  of  generality  we  make  no   assumption  about 
the  number  of  things  in  the  domain,    except  that  there   is   at   least 
one  thing  in  it. 

LA    3 :    The  sign    '  o  '    is  an  individual  constant  of    L  ;    and  if    c 
is  an  individual  constant  of    L,    so  is     c_x.    Each  individ- 
ual constant  of   L   is  interpreted  as  denoting  exactly  one 
member  of  the  domain    D  . 

To  refer  to  these  individual  constants  we  shall  use  the  variables 

*i  .    bj  ,    Cj  ,    dx  ,    a2  ,    b2  ,     c2  ,    d2  ,    a3  , 

which  will  occasionally  be  abbreviated  by 

a  ,    b  ,     c_ ,    d  ,    a "  ,    b '  ,     c_"  ,    d'  ,    a"  ,     .  .  .  , 

respectively.     In  the  above  assumption  it  is  neither  assumed  nor 
denied  that  every  member  of  the  domain  is  denoted  by  some    in- 


2  Note  that  complete  explicitness  here  is  not  needed;     see   Quine  [I960], 
p.    232. 
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dividual  constant  of  L,  and  it  is  neither  assumed  nor  denied  that 
some  individual  in  the  domain  is  denoted  by  two  or  more  individ- 
ual constants  of  L,  but  it  is  assumed  that  no  individual  constant 
denotes  two  or  more  individuals  in  the  domain. 

LA   4:    The  sign    'i'    is  an  individual  variable  of    L;      and  if    v 
is  an  individual  variable  of    L  ,    so  is  vx  . 

To  refer  to  these  individual  variables,  we  shall  use  the  variables 

u,    v,    w,    x,    u,    v,    w,    x,    u,     ..., 

— 1  '      — 1  '      — l'      — 1  '      —2'     —2'      —2'      —2'      —3'  ' 

which  will  occasionally  be  abbreviated  by 

u,    v,    w,    x,    u,    v\    w" ,    x',    \x",     .... 

respectively.  We  make  no  assumptions,  one  way  or  the  other, 
about  whether  L  has  other  types  of  variable.  Hereafter,  when 
we  use  'variable'  without  qualification,  we  shall  mean  the  indi- 
vidual variables  specified  by  LA  4.  These  variables  are  regarded 
as  ranging  over  the  domain  D  ,  in  the  sense  that  if  a  variable 
occurs  as  part  of  a  standard  expression,  that  expression  is  in- 
terpreted as  referring  to  the  domain  in  a  certain  way,  as  indicated 
below. 

Both  the  individual  constants  and  variables  of    L    will  be  called 
terms    of    L  .     To  refer  to  these  terms  we  shall  use  the  variables 

Lj »    I2  '    -3  '     •  •  •  ' 

which  will  occasionally  be  abbreviated  by 

t,    t",    t",     .  .  .  , 
respectively. 

LA    5:    For  any  finite  and  nonempty  sequence     *  .   .   .  *    of    *'  s  , 
'j*  .  .  .  *'     is    a   predicate    constant   of    L  .  If    f    is    a 

predicate    constant  of    L ,    so  is     f ,  .      If    f_    contains    ex- 
actly   n    *'s  ,    then    f    is    interpreted   as    designating    or 
expressing    an  n-ary   relation   among    members    of  the 
domain    D     taken    n    at   a   time    in   a    certain   order,       f 
will   be    said  to   be    n-ary   or   n-place    or    of   degree     n , 
and    f    is  definite  in   the  sense  that  for  every  n-tuple    y 
of  members    of  the    domain    D     the    relation   designated 
or  expressed  by    f    either  holds  or    does  not  hold  among 
the  members  of    y    taken  in  order. 

We  shall  hereafter  usually  use    'predicate'    in  place  of   'predi- 
cate constant'    and   'express'    in  place  of   'designate  or    express'  . 
Occasionally  we  shall  use    'property'    to  refer  to  relations  of  de- 
gree one.        To   refer   to   these   predicates   we    shall  use   the 
variables 
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tx>     g^     fej.    12.     g2>     h2,     f3,     .... 
which  will  occasionally  be  abbreviated  by 

f ,     g,    h,     f",     g',    h',    f",     .   .  .  , 

respectively.  We  neither  assume  nor  deny  that  every  property 
of,  or  relation  among,  members  of  the  domain  is  expressed  by 
some  predicate  of  L  ;  and  we  neither  assume  nor  deny  that  two 
or  more  predicates  of    L    express  the  same  property  or  relation. 

Occasionally  we  shall  speak  of  a  predicate   as   definite   for   a 
set  of  terms,    meaning  thereby  that  the  predicate   is    definite   for 
the  entities  referred  to  by  the  terms.     The  assumption  of  defi- 
niteness  might  seem  dubious  on  the  ground   that   there   might  be 
terms  and  predicates  of    L    which   "just  don't  fit  together",    like 
'Caesar1    and   'trapezoidal'  ;    some  philosophers  have  argued  that 
utterances  like    'Caesar  is  trapezoidal'    are  neither  true  nor  false 
but  are  meaningless.     We   make   the   assumption  of  definiteness 
mainly  because  it  simplifies  the  formulation  of  our    assumptions 
about  the  grammar  and  logic  of    L    and  because   it   is   extremely 
useful  in  our  analysis  of  questions.     It  guarantees   the    existence 
of  questions  of  a  certain  type  and  simplifies  the  procedures  used 
in  the  question -asking  process.     Further,   the  assumption  of  def- 
initeness accords  with  our  actual  question -asking  behavior,    es- 
pecially in  situations  where  the  questioner  is  ignorant  or  is  pio- 
neering in  a  new  field.     It  is  also  compatible  with  the  basic  atti- 
tudes behind  both  sides  in  the    "Caesar-trapezoid"    controversy. 
Here    L    is  allowed  to  have  a  small  vocabulary  so  selected  that 
Caesar  can  never  be  called  trapezoidal,    and    L    is  also  allowed 
to  have  a  vocabulary  so  rich  that  Caesar  can  be  called  trapezoidal 
or  subjunctive  or  postmeridional,    etc.  ,   where  all  such  predica- 
tions are  considered  to  be  meaningful  but  turn  out  to  be   false. 
To  know  that  they  are  false,    R    may  have  to  inspect   the  world  — 
Caesar  in  particular  — or  he  maybe  able  simply  to    consult  the 
rules  of  interpretation  of    L  .3 

The  next  seven  assumptions  specify  the  grammar  of  L.  In 
effect  they  define  the  wffs,  that  is,  the  well-formed  formulas  of 
the  standard  part  of    L  . 

LA   6 :   If    f    is  an  n-place   predicate   as    specified  by   LA    5, 

and    t   ,     ....    t      are  terms  as  specified  by   LA  3  and 
4,   where    n  >  0  ,—  then    f(t    .  .  .  tn)    is  a  wff ;    it  will  be 
called  the  [n-place]   predication  "of    f    and    t    ,  .  .  .  ,  tn. 
An  n-place  predication    f(t    .  .  .  tn)    is   interpreted  as~ 
being  true  just  in  case    t    ,     ...  ,— tn    are   individual 
constants  denoting    y   ,     .  .  .  ,    yn  ,  "respectively,    and 
the    relation  expressed  by   f  holds  among  y   ,     .  .  .  ,    y 


Concerning  the  technique  of  "ruling  false",    see  Quine  [I960],  p.  229. 
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taken  in  that  order,    and  false  just  in  case   t    ,     .  .  .  ,    tn 

are  individual  constants  denoting    y    ,     .   .   .  ,     y     ,    re-  * 

spectively,    and  the  relation  expressed  by    f    do"es    not 

hold  among    y   ,     .  .   .  ,     y       taken  in  that  order. 
°    —  i  -i-n 

LA   7:   Some  predications  as  specified  by  LA  6  are  true. 

This  assumption  does  not  specify  whether  the  rules    of    inter- 
pretation of    L    suffice  to  establish  the  given  predications  as  true; 
the  rules  of  interpretation  specify  that  a  given  predication  is  true 
in  case  certain  conditions  hold,    and  those  conditions  might  hold 
as  a  matter  of  fact,    independently  of  the  rules  of    L  . 

LA   8:   If    t    and    t2    are  terms  as  specified  by  LA  3  and  4,  then 
(t    =  tz)  is  a   wff;    it  will  be  called  the  identity  of  t    and 
t2 .     An  identity    (tx  =  t  )    is   interpreted   as   being   true 
just  in  case    t.    and    t       are  individual  constants  denot- 
ing   y     and    y2,    respectively,    and    y      is  the  same  thing 
as    y   ,    and  false  just  in  case    t      and    t      are    individual 
constants  denoting    y      and    y    ,    respectively,  and  y     is 
not  the  same  thing  as    y   .    Further,   for   every  pair    of 
things    y      and    y      in  the  domain    D  ,    either    y       is    the 
same  thing  as    y     or  it  is  not. 

LA   9:   If    F    is  a  wff,    so  is     -F  ;    it  will  be  called  the  negation 
of    F.        It  is  interpreted  as  being  true  just  in  case    F 
is  false,    and  false  just  in  case    F    is  true. 


We 


shall  abbreviate    »-(t    =  t2)'    by   '(t^t^)', 


LA  10:  If    F    and    G    are   wffs,    so  is    (F  &  G);  it  will  be  called 
the  conjunction  of    F    and    G  ,    and    F    and    G     will  be 
called  its  conjuncts.  It  is   interpreted    as     being 

true  just  in  case    F    and    G    are  both  true,    and  false 
just  in  case    F    or    G    or  both    F    and    G     are   false. 

Occasionally  we  shall  use    '(F    &.  .  .  &Fn)'    as  an  abbreviation 
for    '((-..(  (Fj  &  F2)  &  F3)  &.  .  .   &  Fn_x)  &~Fn)';    we    shall   call 
(F     &  ...    &  Fn)    an  n-place  conjunction,    and- F  ,     .  .  .  ,    F      its 
conjuncts. 

LA  11:  If    F    is  a  wff  and    x    is  a  variable  as  specified  by   LA 
4,   then   (x)F    is  a  wff.     We  shall  call    (Xj)  .  .  .  (xjF 
the  universal  generalization,    or  universal  quantifica- 
tion,    of    F    with  respect  to    x    ,   .  .  .  ,  xn  ,    and    F    its 
quantified  function.    We  shall  designate  ~" (x)     as  a   uni- 
versal  quantifier,    and    x    as  its  quantified  variable.   A 
universal  generalization    G    is  interpreted  as   being 
either  true  or  false  just  in  case  every  variable    v    oc- 
curring in    G    occurs  in  some   wff    (v)H    which  is   part 
of    G  .       A  universal   generalization    G    is    interpreted 
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as  being  true,  or  false,  according  as    G    is,   or   is   not, 
satisfied  by  every  infinite  sequence   of  objects    in  the 
domain    D. 

LA  12;   Nothing  is  an  individual  constant,    individual   variable, 
predicate  constant,   predication,    identity,   negation, 
conjunction,    universal  generalization,   or   wff  of    L  un- 
less its  being  so  follows  from    LA  1  to  11.       The  set  of 
all  wffs  will  be  called  the  standard  part  of    L  . 

Theorem  4.3:  Every  wff  is  effectively  specified. 
Proof:    Every  wff  is  either  a  predication,  an  identity,  a  ne- 
gation,  a  conjunction,  or  a  universal  generalization.      Each 
of  these  is  constructed  by  concatenation  of  wffs  or  standard 
signs  of    L,    according  to  the  rules  provided  by   LA  6  to  12. 

To  characterize  the  sentences  and  theorems  of    L,    we    shall 
use  the  following  auxiliary  notions.     An  occurrence  of  a  variable 
x    in  a   wff    F    is  a  bound  occurrence  of    x    in    F    just  in   case    it 
stands  in  a  wff    (x)G    that  is  part  of    F  .     An  occurrence  of  a  var- 
iable   x    in  a  wff    F    is  a  free  occurrence  of    x    in    F    just  in  case 
it  is  not  a  bound  occurrence  of    x    in    F  .    A  variable    x    is   bound 
or  free  in  a  wff    F    according  as  there  are  bound  or  free  occur- 
rences of    x    in    F.     An  occurrence  of  a  term    t    in  a  wff    F  is  a 
bound  occurrence  of    t    in    F    just  in  case    t    is  a  variable    x     and 
the  occurrence  of    x    is  a  bound  occurrence  of    x    in    F  .     An  oc- 
currence of  a  term    t    in  a  wff    F    is  a  free  occurrence  of    t     in 
F    just  in  case  it  is  not  a  bound  occurrence  of    t    in    F,     A  term 
t    is  bound  or  free  in  a  wff    F    according  as  there  are  bound  or 
free  occurrences  of    t    in    F  . 

From  the  foregoing  assumptions  about    L    it  follows  that  some 
wffs  have  no  free  variables;    we  shall  refer  to  such  wffs  as    de- 
clarative sentences.     Occasionally  we  shall  use    'statement'    as 
an  abbreviation  of   'declarative  sentence'  . 

LA  13:  Every  wff  is  meaningful,  and  not  meaningless.  Every 
statement  is  either  true  or  false.  If  F  is  interpreted 
as  true,    then  it  is  true. 

This  assumption  is  clearly  reasonable.7    The  first  part  relates 
our  analysis  of    L    to  the  ordinary  notion  of  meaningfulness.       It 
states  that  the  standard  part  of    L    is  meaningful,   but  it  does  not 
state  that  any  other  expressions  of    L    are  meaningless. 


4  We  assume  that  we  have  available  a  satisfactory  analysis  of  "satis- 
faction";    see  Tarski  [1956],  Chap.  VIII,  Sec.  3. 

5  Cf.    Church  [1956],   p.   170. 

6  Adapted  from  Montague  [1961],   pp.   167-169. 

7  Concerning  the  background  and  justification  of  the  truth-or -falsity 
assumption,    see  Tarski  [1956],  Chap.  VIII,  Sec.  3. 
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LA  14:    We  make  the  following  assumptions: 

(1)  The   wff  -  (F  &  -G),  which  we  shall  refer  to  by   »(F  -*>  G)', 
is  interpreted  as  asserting  that  if    F    then    G.  We  shall 
call    (F  ->  G)     a  conditional,     F    its  antecedent,    and    G 
its  consequent. 

(2)  The    wff  [(F  ->  G)  &  (G  ->  F)]  ,  which  we    shall    refer    to 
by   ((F  * — >  G)1  ,    is  interpreted  as  asserting  that    F     if 
and  only  if    G  .     We  shall  call    (F  ■* — *■  G)     a  biconditional, 
or  a  material  equivalence,    and    F    and    G    its    equiva- 
lents. 

(3)  The   wff    -(-F&   -G)  ,    which  we    shall   refer    to   by 
'(F  v  G)'  ,    is  interpreted  as  asserting  that    F    or   G   in 
the  inclusive  sense  of   'or'.     We  shall  call    (F  v  G)    an 
inclusive  disjunction,    or   sometimes  simply  a  disjunc- 
tion,   and    F    and    G    its  disjuncts. 

(4)  The   wff  [(F  v  G)  &T-(F   &  G)]  ,    which  we    shall   refer 
to  by   !(F  ><  G)'  ,    is  interpreted  as  asserting    that     F 
or    G  ,  in  the  exclusive  sense  of   'or'  .       We    shall    call 
(F  ><  G)     an  exclusive  disjunction,  or  sometimes  simply 
a  disjunction,    and    F    and    G    its  disjuncts. 

(5)  The    wff    -  (x  )   -  .   .  .  -  (xn)  -F  ,  which  we  shall  refer  to 
by   f(Ex  )  .  .  .  (Ex   )FJ  ,   we    shall   call  the   existential 
generalization,    oF  existential  quantification,  of   F   with 
respect  to    x   ,   .  .  .  ,  xn,    and    F    its    quantified  func- 
tion.    We   shall  call    (Ex)     an  existential  quantifier,  and 
x    its  quantified  variable.    An  existential  generaliza- 
tion   G    is  interpreted  as   being    either    true    or    false 
just  in  case  no  variable  occurs  free  in    G;     and    G    is 
interpreted  as  being  true,    or  false,    according    as     G 
is,    or  is  not,    satisfied  by  at  least  one  infinite  sequence 
of  objects  in  the  domain    D. 

As  with  conjunction,  we  shall  occasionally  use  '(F    v  .   .   .  v  Fn)' 

as  an  abbreviation  for    '((...(  (F   v  F  )  v  F„)  v  .  .   .  v  F      )  v  F^)' : 

\  \  v  v_j       — 2/        _3/  — n-l'       — n'   ' 

we  shall  call     (F   v  .   .   .  v  Fn)     an  n-place  disjunction,    and    F  , 

.   .   .   ,  F_     its  disjuncts. 
'  — n  + 


5.    THE   RECEIVER'S   LOGIC 


The   logical   apparatus  of    L    is   important   in  our  analysis  for 
several   reasons.       First,    the   deduction  rules  of    L    constitute 
the   means   by  which    R    extracts   the   usable    content   from     m  . 
Second,    the   logical   truths    of    L ,    and  more   generally  all  the 
statements  whose  truth  is  established  by  the  rules  of  L  ,  consti- 
tute knowledge  which    R    can  provide   himself;    knowledge  which 
R    can   obtain  simply  by   operating  with,    or  on,    the    rules  of    L; 
knowledge  which  he   does   not  have   to  obtain  from  anyone    else; 
knowledge  for  which  he  does  not  have  to  pay  S  .    Third,    L  might 
have  so  rich  a  logical  system  that    R    cannot  mechanize  the  proc- 
ess  of  deducing   theorems   or   the   process   of   showing  that  non- 
theorems  are  not  theorems.1    The  importance   of  this   is   that   it 
adds  a  further    "degree  of  freedom','   to  the  behavior  of    S    and    R 
in  the  game  they  are  playing.   R    might  not  have  the  ability  or  the 
patience  to  explore  all  of  the  logical  consequences  of  a  message, 
or  he   might  lure    S    into  a  trap  by  leading    S    to  believe  that     R 
is  less  of  a  logician  than    R    in  fact  is,   and  so  on. 

LA  15:    The  axioms  of    L    are: 

(1)  all  wffs   which  are  tautologies; 

(2)  all  conditionals  [(x)F  -*■  G],  where  G  is  like  F  ex- 
cept for  containing  free  occurrences  of  some  term  t 
wherever    F    contains  free  occurrences  of    x; 

(3)  all   conditionals    [(x)(F  ->  G)  -*  [F  -*  (x)G]],    where     x 
is  not  free  in    F  ; 

(4)  all  identities    (t  =  t)  ,   where    t    is  a  term; 

(5)  all   conditionals    [(t    =  t  )  ->  (F  < — *■  G)]  ,   where    t      and 
t      are  terms  and    G    is    like    F    except  for  containing 
free  occurrences  of    t.    where    F    contains   free   oc- 
currences of    t    ; 

(6)  all  wffs  in  the  set  M,  where  M  consists  of  n  effec- 
tively specified  wffs,    where    n  >  0  . 

Here  (1)  is  a  basis  for  L's  logic  of  sentences,  (2)  and  (3)  for 
L's  predicate  logic,  and  (4)  and  (5)  for  L's  logic  of  identity.  The 
set  M  is  intended  to  consist  of  nonlogical  axioms,  axioms  for 
mathematics,    "meaning  postulates",    and  so  on.2 


1  See  Kleene  [1958]    and   Quine  [1959],    Sec.  41. 

2  On   (1)  to  (5)  see  Church  [1956],   Sees.  15,  30,  48;  also  Copi  [1954], 
Chap.  IX.    On  meaning  postulates,  see  Carnap  [1952b]. 
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LA    16:    The  set  of  deduction  rules  of    L    is  effectively  speci- 
fied.    One  deduction  rule  of    L    (which  we  shall  refer 
to  by   'MP',    for    'modus  ponens')    is:     If    F     and     G 
are  wffs,    then    F    is  derivable  from    G    and    (G  -*■  F). 

LA  17:    F    is  derivable  in    L    from    G  ,     .  .  .  ,    Gn    if  and  only 
if  there  is  some  finite    sequence   of  wffs  "whose   last 
member  is    F  ,    such  that  each  member  of  the  sequence 
either  is  one  of    G  ,     .  .  .  ,    Gn    or  is  an  axiom  of  L   or 
is  derivable  from  preceding   members  via  the  deduc- 
tion rules  of    L  .     F    is  provable  in    L    if  and  only  if  F 
is  derivable  in    L    from  an  axiom  of    L  .    F    is    L-true 
if  and  only  if    F    is  a  statement  and    F    is    provable    in 
L  .     Further  : 

(1)  If  F  is  provable  in  L  and  x  is  a  variable,  then  (x)F 
is  provable  in  L  .  [We  shall  occasionally  refer  to  this 
fact  by   'Gen1,   for    'generalization'.] 

(2)  If    (F  >« — *■  G)    is  provable  in    L  ,   and    H    and    J   are  wffs 
where    J    is  like    H    except  for  containing  some  occur- 
rences of    G    where    H    contains  occurrences  of  F,  then 
(H  < — *■  J)    is  provable  in    L  . 

(3)  If    F    is  derivable  in    L    from    G,,    .  .   .  ,    G    ,     then 

(G„  ->  F)    is  derivable  in    L    from    G — ,    G       .3 

v— n       _'  —  — 1»  — n-i 


Hereafter  we  shall  usually  omit   'of    L'    and   'in    L'    following 
'axiom',    'derivable',   and  'provable'.    We  shall  say  that  the    clo- 
sure  of  a  wff   F    is    (xn)  .  .  .  (x.)F.   where    x  »     .  .  .  ,    xn     are   all 
of  the  free  variables  of    F,   arranged  in  alphabetical   order.     [It 
follows  that  the  closure  of  a  wff  is  a  statement,  and  that  a    state- 
ment is  its  own  closure.]      We  shall  say  that    F    L-implies    G    if 
and  only  if    (F  ->  G)    is  provable,   that    F    and    G    are    L-equiva- 
lent  if  and  only  if    (F  < — *■  G)    is  provable,   that    F    is    L-false   if 
and  only  if    -F    is  provable,    and  that    F    is     L-consistent   if  and 
only  if    F    is  not    L-false.    Occasionally  we  shall  use  'consistent' 
for     'L-consistent'    and    'inconsistent'    for    'L-false'.         We  shall 
say    'F    and    G    are  inconsistent',  or    'F  and  G   are  incompatible', 
or    'F    is  inconsistent  with  G'     for    '(F  &  G)     is     L-false'. 

LA  18:    The  standard  part  of    L    is  consistent,  in  the  following 
sense:    Not  every  wff  is  provable;    in  particular,    if     F 
is  provable,    then    -F    is  not  provable.    If   F    is     L-true, 
then  it  is  true;    if  it  is  true,    then  it  is  not  false.     If    F 
is  false  or    L-false,    then  it  is  not  true;  if  it  is  not  true, 
then  it  is  not    L-true. 


3  See  Church  [1956],   Sec.  36. 

4  See    Quine  [1951],   Sec.  14. 
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The  justification  for  LA  18  will  be  discussed  later.  First  we 
shall  list  some  theorems,  many  of  which  will  be  important  later 
in  our  analysis  of  questions  and  message -processing. 

Theorem  5.1 :    Not  all  wffs  are  true;    not  all  wffs  are  false; 
not  all  wffs  are    L-true;    and  not  all  wffs  are    L-false. 
Proof:    By  LA  7,    some  wff    F   is  true.     By  LA  9,     -F     is 
false;   by  LA  18,   -F    is  not  true  and  not    L-true,  and  hence 
F    is  not    L-false.     By  LA  18,   F    is  not  false. 

Theorem  5.2:   It  might  be  the  case  that  some  wffs    F     are 
such  that  neither    F    nor     -F    is  provable. 

Proof:   It  might  be  the  case  that  neither    (a  =  b)    nor  (a  #  b) 
is  provable. 

For  the  following  theorems  we  assume  that   F,   G,  and  H   are  wffs. 

Theorem  5.3:    F    is  provable  if  and  only  if  the  closure  of   F 

is    L-true. 

Proof:   If    F    is  provable,   then  by  Gen  so  is    (x)F  .  If  the 

closure  of    F    is    L-true,   then,   by  LA  15    (2)  and  MP,    F  is 

provable. 

Theorem  5.4:   If    F    L-implies    G  ,   then: 

(1)  if    F    is  true,    so  is  the  closure  of    G; 

(2)  if    F    is  provable,    so  is    G; 

(3)  if    F    is    L-true,    so  is  the  closure  of    G; 

(4)  if    G    is    L-false,    so  is    F  . 

Proof  of  (1):   If    (F  -*•  G)    is   provable,   then  by  Gen   so   is 
(x)(F  -*■  G)  .     Since    F    is  true, it  has  no  free  variables;  hence, 
by   LA  15    (3)  and  MP,    [F  -*  (x)G]    is  provable.         By  this 
method  we  can  obtain    (F  ->  G")     as   a  provable    statement, 
where    G"    is  the  closure  of    G.     Then    (F  -*■  G")    is    L-true 
and  hence  by  LA  18  is  true.     But    F    is  true;    hence,  by  the 
interpretation  of  *->',   G'    must  be  true  also. 
Proof  of  (2):    This  holds  because  of  MP. 

Proof  of  (3):  As  in  the  proof  of  (1),     (F  -*■  G')    is    L-true  and 
hence  is  derivable  from  an  axiom.     But  so  is    F  .    Hence,  by 
MP,   G'    too  is  derivable  from  an  axiom  and  thus  is    L-true. 
Proof  of  (4):      This   holds   because   of  the   tautology 
[-G -*  [(F  ->  G)  -  -F]]  ,   and  MP. 

Using  the  above  theorems,    and  methods  like  the  ones  used  in 
proving  those  theorems,   we  can  obtain  the  following: 

Theorem  5.5:   If   F    is   provable,    then    F    is    L-implied  by 
any    G  . 

Theorem  5.6:   If    F    is    L-false,  then    F    L-implies  any     G. 

Theorem  5.7:   If    F    and    G    are  both  provable,   or  both    L- 
true,    so  is    (F  &  G)  . 
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Theorem  5.8:    If    F    is    L-false,    so  is     (F  &  G)  . 

Theorem  5.9:    If    F    L-implies    G    and    G    L-implies     H, 
then    F    L-implies    H. 

Theorem  5.10:    If    F    L-implies   G  ,    so  does     (F  &  H)  . 

Theorem  5.11:    If    F    L-implies    G,  then   F    L-implies    (G  v  H). 

Theorem  5.12:   If    F    L-implies    G    and    F    L-implies    H,  then 
F    L-implies    (G  &~H)~7 

Theorem  5.13:    If    F    is     L-implied  by    G    and  also  by    H,  then 
F    is    L-implied  by"  (G  vTl)  . 

Theorem  5.14:   F    L-implies    F. 

(F  &  -F)    is    L-false. 

The  closure  of    (F  v  -F)    is    L-true. 

If    F    is    provable,  then    (F  &  G)     is    L-equiva- 


Theorem  5.15 


Theorem  5.16 


Theorem  5.17 


lent    to    G  . 

Theorem  5.18:   If    F    is    L-false,   then    (F  v  G)    is    L-equiva- 
lent  to    G_. 

Theorem  5.19:   If    F    L-implies    G  ,    then    (F  &  G)    is    L- 
equivalent  to    F  . 

Theorem  5.20 ;  If   F    and    G    are    L-equivalent,  then    G    or    -G 
is  provable  according  as     F    or    -F  ,  respectively,  is  provable. 

Theorem  5.21 :  It  is  not  the  case  in  general  that,  if  F  is  L- 
consistent  and  G  is  L-consistent,  then  (F  &c  G)  is  L-con- 
sistent. 

Is  there  any  justification  for  our  assumption  that  the  standard 
part  of    L    is  consistent?    To  justify  this  assumption,   we    should 
have  to  specify  precisely  the  additional   content  of  the    standard 
part  mentioned  in   LA  15    (6)   and  then  discover    a   consistency 
proof.     Now  the  standard  part  of    L    resembles   many  languages 
of  a  certain  type  in  widespread  use  today;    that  is,   with   suitable 
specification  in  connection  with   LA  15   it   could  be   made   to   re- 
semble them.     It  seems  very  unlikely  that  all  languages    of  this 
type  are  inconsistent,    and  if  any  one  of  them  is  shown  to  be  con- 
sistent by  satisfactory  methods    of  demonstration,  then   our    as- 
sumption in  LA  18  will  be  justified. 

If  it  should  be   discovered   that    L    is  inconsistent,    R     might 
choose  a  language  built  around  a   theory  of  types   rather   than  a 
theory  of  sets;5    if  he  did,   his  language  would  differ    from    L     in 
fundamental  ways,    not  only  in  its  axioms  but  also  in  its  grammar. 
The  analysis  developed  in  this  book  could  probably  be  carried  over 


See  Fraenkel  [1958],    Chap.  III. 
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to  such  a  language,   but    R    and    S    would  have  to  sacrifice  many 
questions  and  answers  which  are  allowed  them  if  they  use    L.   Al- 
ternatively,   R    might  choose  a  language  whose  deductive  appara- 
tus is   one    of  the  varieties  of   "intuitionist"   logic.       Such  a  lan- 
guage would  differ  from    L    not  so  much  in  its  grammar  as  in  its 
L-truths.     We  could  probably  develop  an  analysis  of  questions  for 
such  a  language,   along  the  same  general  lines  as  the  analysis  de- 
veloped below,   but  it  would  differ  in  many  of  its   logical   details 
and  would  probably  be  much  more  complex. 

How  adequate  a  language  is    L?  Some  philosophers  have  claimed 
that  linguistic  and  logical  apparatus  of  the  sort  specified  by   LA 
1  to  18  is  an  adequate  organon  for  expression  and  argument  — 
adequate  not  only  for  mathematics  and   science,   but  for    all   ra- 
tional discourse  of  any  kind.7    Here  we   do  not   claim  that    L    is 
adequate  for  all  rational  discourse  or  even  for  all  scientific  dis- 
course in  particular .     We   might   eventually  want  to    supplement 
LA    1  to  18,   to  provide    L    with  a  better  logic  of  descriptions,   a 
logic  of  modalities,   or  the  like.8    But  we  do  claim  that  the  stan- 
dard part  of    L  ,   as  specified  by   LA  1  to  18,   provides   a   good 
basic  language,    a  language  which  will  be  adequate  to    R's    needs 
in  a  wide  range  of  communication  situations. 


6  See  Fraenkel  [1958],   Chap.  IV. 

7  Quine  [i960],  Sees.  33,    47,    49. 

8  See,    e.g.,    Montague  [I960,    1961], 
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How  does  a  rational  man  ask  questions?  We  must  already  know 
part  of  the   answer,    if  we   even  presume   to   ask  this   question; 
what  we  want  is  the  rest  of  the  answer  —  a  rigorous  and  fruitful 
logic  of  questions.  A  complete  theory  of  questions  would  include: 

(1)  a  phenomenology  of  the  questioning  mood, 

(2)  a  grammar  of  the  interrogative   "mood"   in  ordinary 
language, 

(3)  a  calculus   of  "the   doubtful"   as    one   among    several 
truth-values, 

(4)  an  organon  for  the  one-person  problem -solving  situation, 

(5)  an  organon  for   the   two-person  question-and-answer 
situation. 

In  this  book  our  primary  concern  is  with  (5).  Our  objective  is  to 
construct  a  model  of  a  rational  procedure  for  asking  questions 
and  giving  answers,  applicable  in  situations  where  the  receiver 
wants  to  acquire  information,  explore  meanings,  and  evaluate 
the  quality  of  the  sender  as  an  information  source.  We  assume 
that  no  satisfactory  theory  of  the  question-and-answer  process 
has  hitherto  been  developed  and  that  we  are  free  to  propose  our 
own.  By  way  of  formulating  conditions  of  adequacy  for  our  analy- 
sis here,  we  stipulate  that  any  theory  of  a  rational  question-and- 
answer  process  should  have  the  properties  stated  in  the  numbered 
paragraphs  below.  [Note  that  the  unnumbered  paragraphs  con- 
tain informal  explanations  and  comments.] 

(1)  The  theory  should  explicate  'question1  in  such  a  way  that 
the  user  of  L  can  effectively  construct,  for  each  one  of 
his  interests,  a  question  which  (a)  corresponds  to  this 
interest  in  a  reasonable  way,  (b)  is  not  meaningless,  and 
(c)    does  not  commit  the  fallacy  of  many  questions. 

The  general  point  of  (b)  and  (c)  here  is  that  in  a  two-person 
situation  the  questioner  is  responsible  for  his  questions.  He  will 
be  penalized  if  he  asks  questions  which  are  defective,  and  hence 
he  would  like  to  be  able  to  avoid  defective  questions.  Further- 
more, if  the  questioner  avoids  defective  questions,  both  parties 
will  benefit  in  another  way:  Defective  questions  have  to  be  chal- 
lenged and  corrected,  and  thus  the  question-and-answer  inter- 
change, if  it  does  not  involve  defective  questions,  is  smoother  and 
more  efficient. 
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(2)    The  theory  should  explicate    'question'    in  such  a  way  that 
every   question  is    effectively  recognizable   as   being   a 
question. 

The  point  of  this  requirement  is  that  the  questioner  wants  the 
respondent   to  be   able   to   tell  when  he   is   being   questioned. 

(3)  The  theory  should  explicate  'question'  in  such  a  way  that 
for  every  question  there  is  a  reply  which  answers  the 
question  in  some  reasonable  sense. 

We  make  no  requirements  on  the  number    of  answers   which  a 
question  may  or  must  have.     It   can  be   argued  that   in  ordinary 
language  there  are  rational  questions  which  have    (a)    exactly  one 
correct  answer,    (b)    exactly  one  incorrect  answer,    (c)   more  than 
one  correct  answer,    (d)   more  than  one  incorrect  answer,    (e)   no 
true  answer,    (f)   no  false  answer.     An  example   of  the   first  two 
kinds  is  'Does  one  plus  one  equal  two?'   where    'yes'    and  'no*  are 
the  two  answers.     Most  questions  have  a  plurality  of  correct  and 
incorrect  answers.     'Who  wrote  Waverley?'    can  be  answered  by 
'Scott',    'The  author  of  Ivanhoe',     'George   IV1,   and   'Bertrand 
Russell'.     An  example  of   (e)    is  'What  are  all  the  integers?'  con- 
strued as  requesting  a  complete  list.     A  true  answer  would  give 
an  infinite  list,   but  no  list  can  be  infinite.     An  example  of  (f)   is 
'What  is  something  which  can  be  talked  about?'  or  'What  is  some- 
thing which  belongs  to  a  class  of  things?'  .     Since  each  thing  can 
be  talked  about  and  each  thing  belongs  to  some  class,    '.  .  .    can- 
not be  talked  about'    and   '  .  .  .  belongs  to  no  class'    are   false   no 
matter  what  name  is  substituted  for    '...'. 

(4)  The  theory  should  distinguish  between  replies  which  are 
complete  and  sufficient  as  answers,  and  replies  which 
are  incomplete  or  insufficient  as  answers.  Furthermore, 
both  the  questioner  and  the  respondent  should  be  able  ef- 
fectively to  determine,  for  any  given  question,  what  re- 
plies count  as  complete -and-just- sufficient  answers  to 
that  question. 

The  point  of  the  complete -incomplete  distinction  is  that  it  con- 
stitutes the  first  step  toward   a   quantitative   analysis   of  the   re- 
spondent's behavior.      This  will  prove  to   be    very   important   for 
the  present  inquiry.     The  point  of  criteria  for  just- sufficient  an- 
swers is  that  the  respondent  must  know  how  to  put  together  some- 
thing which  the  questioner  will  accept  as  an  answer.  This  require- 
ment is  indispensable.     A  question-and-answer  procedure   which 
cannot  meet  it  is  irrational,    for  if  the  respondent  makes  a   reply 
the  questioner  must  ask  'Was  that  an  answer?'  and  if  the  respond- 
ent replies  to  the  latter  question  the  questioner  must  again  ask 
'Was  that  an  answer?'  ,    and  so  on  to  infinity.     It  would  be  absurd 
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to  demand  criteria  for    true  or  correct  answers.     We  do   not   re- 
quire that  the  content  of  the  answer  be  good,   but  merely  that   the 
answer  have  a  form  which  was  previously  agreed  on  as  acceptable. 
Furthermore,    we  make  this  requirement  only  for  just-sufficient 
answers.     It   would  be  demanding    too   much  to    ask   for    criteria 
whereby  one    could   effectively  recognize    expressions   as   being 
equivalent  to  just- sufficient  answers  or  as  implying  just- sufficient 
answers.     It  would  likewise  be  excessive  to  demand   criteria   for 
recognizing  expressions  as  being  partial  answers,    that   is,     ex- 
pressions which  are  implied  by  just-sufficient  answers.        These 
demands  would  be  absurd  for  the  reason  that,    as  logicians   have 
shown,    such  criteria  simply  do  not  exist.     A  corollary  to  this  is 
that  it  would  also  be  too  much  to  demand   a    criterion   for    distin- 
guishing between  answers  of  any  degree    of   completeness    on   the 
one  hand  and  replies  of  nonanswer  types  on  the  other. 

(5)  The  theory  should  explicate  various  relations  of  relevance 
among  questions.     In   particular    it    should   yield   a    con- 
tainment relation  which  is  transitive,    and  an  equivalence 
relation  which  is  reflexive,    symmetric,    and  transitive. 

The  point  of  having  an  explication  of  these  notions  is  that  they 
can  provide  the  questioner  with  a  means  for  planning  question 
campaigns.  If  he  has  a  large  number  of  questions  to  ask,  he 
might  be  able  to  reduce  some  to  others  by  way  of  equivalence  or 
containment  relationships. 

(6)  The  theory  should  explicate    'answer'    in  such  a  way  that 
answers  can  be     L -implied  by  statements. 

The  effect  of  this  last  condition   of   adequacy   is    that   answers 
must  be    wffs  ;    they  cannot  be  mere  individual  constants  or  pred- 
icates in  the  sense  of   LA  3  or  5.     It  might  be  thought  that  (6)    is 
too  restrictive  or  not  necessary.       Our   motivation  in  stipulating 
(6)  is  mainly  that  it  secures  that  answers  can  be  implied  by  mes- 
sages;   our  model  of  communication  is  thereby  considerably  sim- 
plified.    But    (6)    can  be  justified  on  other  grounds  as  well.        For 
example,    if    R    asks  a  question  and    S     replies    with   a   name    or 
predicate,     R    must  still  construe  the    name    or   predicate    as    an 
abbreviation  of  a  statement,    for  it  is  only  a  statement   which   is 
true  and  hence  usable. 

We  accept  the  above  paragraphs    (1)    to    (6)    as    conditions    of 
adequacy  on  our  analysis.     How   far    the    analysis    fulfills    these 
conditions  is  discussed  later.        We    do   not   claim  that   (1)    to  (6) 
constitute    the   best    set   of   conditions    of   adequacy,    or    that   the 
theory  presented  below  is    the    only  theory  which   fulfills    them. 


•No  matter  how  we  finally  analyze  questions,  we  know  this  for 
sure:    Questions  are  not  statements'   —  so  say  various  philoso- 
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phers  and  defenders  of   ordinary  language    and   common   sense.1 
The  theory  we  shall  present  below  is  based  on  the  opposite   view; 
we  shall  construe  questions  as  statements  and  regard  the  logic  of 
questions  as  already  existing  within  the  logic  of  statements.  The 
theory  will  be  presented  in  detail  in  later  sections;      this    section 
is  devoted  to  rough  preliminary  explanations. 

Our  basic  idea  is  that  particular  types  of  question  can  be  iden- 
tified with  particular  types  of  statement.        "Whether"   questions 
can  be  identified  with  statements  which  behave  like  exclusive  dis- 
junctions —  that  is,    statements  of  the  form   'F    or  else    G'  ;    the 
disjuncts    F    and    G    can  be  taken  to  be  the  answers.         "Which  - 
ones"   questions  can  be  identified  with  existential  generalizations 

—  that  is,  statements  of  the  form  'Some  things  are  F1  ;  for  an 
answer  to  such  a  question  we  can  take  a  list  of  things  which  are 
F  ,  plus  a  statement  that  nothing  else  is  F  .  It  might  turn  out 
that  this  proposal  is  a  fruitful  foundation  for  other  parts  of  the 
theory  of  questions,  but  the  primary  and  guiding  intention  here 
is  only  that  this  proposal  serve  to  reconstruct  in  a  satisfactory 
way  the    relevant   concepts    of  the   question-and-answer  process. 

This  identification  of  questions   with   statements   is   the   basic 
notion  in  our  syntactics  of  questions  —  that  is,  it  is  the  basic  no- 
tion in  the  syntactical  part  of  our  semiotic  of  questions.  The  basic 
notion  in  our  pragmatics  of  questions  is  a  reinterpretation  of  the 
question-and-answer  process.  In  this   reinterpretation,    the 

question-and-answer  situation  is  construed  not  as  a  sequence   of 
requests  and  satisfactions  but  as  a  game  played  with   statements 

—  specifically,  an  information- matching  game.  Information-seek- 
ing questions  are  regarded  as  tools  for  reducing  states  of  rational 
doubt.     The  questioner  expresses   what  he   knows    concerning   the 
given  subject  matter  by  saying  'This  or  that  is  the  case'  or  'Some 
things  are  so  and  so';     and   the    respondent   then,     if  he    chooses, 
matches  this  with  a   counter  statement  which  provides   more   in- 
formation and  thereby  lessens  the  questioner's  doubt.    He  can  say 
'This  rather   than   that  is  the  case'    or  'The  things  which  are    so 
and  so  are    a    and    b'  .       Given  an  answer,  the  questioner  can  then 
proceed  with  a  further  expression,  and  the  respondent   can  match 
it  with  further  information.      This  schema  fits  not  only  the  infor- 
mation-seeking situation  but  also  the  situation  in  which  the  ques- 
tioner knows    the  answer  and  is    being  rhetorical  or  is  testing  or 
playing  with  the  respondent.    In  these  latter  cases  the  questioner 
articulates  not  his  own  ignorance  but  a  state  of  ignorance  in  gen- 
eral.   The  respondent  may  have  any  of  several  motivations.      He 
may  be  motivated  by  desire  to  help  the  questioner,    or    desire  to 
reveal  or    "show  off"   his  own  knowledge,  or  desire  to  create   a 
witty  interchange,  etc.    The  interchange  will  continue  as  long   as 
both  parties  remain  sufficiently  motivated  by  their  respective  desires. 

1  E.g.,  Hospers  [1953],  p.  69;    Wittgenstein  [1956],  p.  49;    Hamblin  [1958]. 
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Since  questions  are  statements,  they  can  be  true  or  false.  In 
the  formal  developments  below  we  shall  pay  particular  attention 
to  questions  which  are  true.  The  great  advantage  of  true  ques- 
tions is  that  they  cannot  be  challenged  as  improper.  A  true  ques- 
tion does  not  commit  the  fallacy  of  many  questions,  and  it  cannot 
be  meaningless.  Furthermore,  the  questioner  can  always  find 
a  true  question;  that  is,  if  he  is  interested  in  a  statement  F,  but 
does  not  know  whether  F  is  true,  he  can  construct  a  safe  ques- 
tion about    F,   namely,    the  true  statement   (F  v  -F).2 

We  must  assume  that  the  respondent  has  a  criterion  by  which 
he  can  know  when  he  is  playing  the  information-matching  game.  Ex- 
actly what  this  game  is,   how  it  is  to  be  distinguished  from  other 
language  games,   and  how  one  person  can  tell  when  another  per- 
son wants  to  play  the  game   with  him     are   problems   we   bypass 
here.     These  problems  are  part  of  a  large  class   of  closely  re- 
lated problems  belonging  to  pragmatics.     We  need  not   consider 
it  a  shortcoming  of  the  present  analysis  that  it  does  not  deal  with 
these  problems;    after  all,   there  are  analogous  problems  regard- 
ing the    "inference  game".     Logic  —  that  is,   the  logic   of   state- 
ments —  tells  us  about  implication,   but  neither    logic   nor    any 
other  instrument  yet  developed  tells  us  which  implications   of  a 
given  sentence  are  or  ought  to  be  entertained  by  human  minds  in 
actual  psychic  events;    nothing   yet  tells   us   how  a  human  mind 
decides,    or  ought  to  decide,   when  to  begin  inferring  consequences 
from  given  sentences.         Therefore,    if  a   critic   objects   that  the 
present  analysis    "doesn't  really  analyze   the   question   situation 
per  se",   the  reply  is  that  we   hope   to   achieve    such  an  analysis 
eventually,  but  we  are  not  hampered  by  lack  of  one   meanwhile, 
any  more  than  the  logician  is  hampered  by  lack  of  an  analysis  of 
the  inference  situation.     For  present  purposes  there  seems  to  be 
no  reason  why  we  cannot  assume  that  the  participants  in  the   in- 
formation-matching game  have    simply  decided,   from  whatever 
motivation,    to  play  the  game.     Once   the    respondent  knows   that 
he  is  playing  the  game,   he  can  recognize  questions  by   (1)   their 
grammatical  form  and   (2)   their  being  uttered  by  another  person, 
either  real  or  imaginary. 

Our  theory  requires  us  to  reinterpret  several  particular  ques- 
tions and  answers.       Some   questions    and  answers   are   uttered 
within  an  information -matching   game,   but  others   are   uttered 
about   this    game,   perhaps   within  a  higher -level   information- 
matching  game.     For  example,    the  replies    'I  don't  know',    'Pos- 
sibly' ,    and  'I  haven't  decided'    might  be  construed  not  as  answers 


2  In  this  respect  the  analysis  presented  here  generalizes  on  that  of 
Harrah  [1961].  There,  questions  are  required  to  be  true;  the  advan- 
tages of  dropping  this  requirement  were  pointed  out  to  me  by  Nuel  D. 
Belnap,    Jr. 
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to  questions  but  as  comments  about  one's  moves  in  an  information 
game.     Similarly,    some  questions  are  construed  as   formulated 
in    L,    such  as    'Will  it  thunder  again  today?'  ,   but   others   might 
have  to  be  formulated  in  a  metalanguage  of    L  .     An  example   of 
the  latter  kind  is    'Does    "thunder"  begin  with   "t"?'. 

When  we  say  that  questions  are  statements,   we  do  not  also  say 
that  asking  a  question  is  psychologically  the    same   as   making   a 
statement.     The  psychology  of  these  two  behaviors  may  be   dif- 
ferent,  but  this  is  no  objection  to  the  present  proposal.        Again 
we  cite  the  logic  of  implication.   What  emerges  as  a  publicly  com- 
municable implication  statement  may  have   begun  as    a  private 
pictorial  intuition  about  a  particular  relationship  of  containment 
or  togetherness.     In  a  similar  way,   a  questioner  might  be  moti- 
vated by  feelings   of  ignorance,    desire,    or   the   like,   but  what 
emerges  from  the   questioner   is   a  publicly   communicable   and 
logically   "hard"    entity.     It  might  even  be  argued,    on  the   basis 
of  some  of  the  results  presented  later,     that  in  a  context  of  ra- 
tional communication  questions  have  more  in  common  with  state- 
ments,   even  psychologically,   than  with  the    "private  thoughts" 
which  motivate  and  accompany  the  asking  of  the  question.3 


3  The    first    statement  of  the    foregoing   theory   of  questions  seems  to 
have   been   that  of  Harrah  [1961],   but   the   following  should  be  cited  as 
precursors : 

(1)  John   [1957],    which   identifies    questions   with   experiments; 

(2)  Hamblin   [1958],    which   postulates    that  knowing  a   question  is 
equivalent   to   knowing   what   counts  as  an   answer  and   that   the 
possible    answers  to  a  question   form   an   exhaustive  set  of  mu- 
tually exclusive  possibilities; 

(3)  Leonard   [1959],   which   analyzes    questions  as    capable  of  being 
true  or  false  in  a  derivative  sense. 

Kubinski  [1958],  which  came  to  my  attention  when  the  manuscript  of 
this  book  was  nearly  finished,  overlaps  and  thus  anticipates  my  analy- 
sis in  some  ways  but  differs  significantly  in  others.  In  particular, 
Kubinski  does  not  identify  questions  with  statements. 

Incidentally,    the    correlation   of  yes-or-no   questions    with   logically 
true  disjunctions  was  made  independently  by  Wells  [1961],   p.  242. 


7.    A    LOGIC    OF   QUESTIONS 


The  first  type  of  question  we  shall  analyze  is  the  "whether" 
question.  The  simplest  type  of  whether  question  is  the  whether - 
or-not,  or  yes-or-no  question;  this  can  be  regarded  as  a  one- 
place  whether  question,  in  contrast  to  n-place  whether  questions. 
An  example  of  a  four -place  whether  question  is  'Is  it  raining, 
hailing,  sleeting,  or  snowing?'  .  If  we  wanted  only  a  procedure 
for  asking  questions  in  a  clear  way,  we  could  take  the  one -place 
question  as  our  paradigm.  If  we  want  a  theory  of  questions  which 
is  usable  within  the  wider  framework  of  the  theory  of  communi- 
cation, however,  we  should  allow  n-place    questions    for    any     n  . 

To  facilitate    our  exposition   we  shall   use  the  following    device 
of  abbreviation:  Suppose  we  have  a  finite  sequence    Gj,   .   .  .   ,  G 
of  wffs,    where    n  >  1  .     Then  for  each    i     such  that     1  <  i  <  n,  we 
shall  use    'G-*'    to  refer  to 

((...((((...  (-Gj  &  -G2)  &...)&  -G.J  &  Gt)  &  -Gi+1)  &...)&  -GJ, 

That  is,    G-*    is  a  certain  conjunction  —  namely,   the  unique  con- 
junction of    G-    with  the  negations  of  all  the  other    G's    in  the  se- 
quence,   where    the  conjuncts    occur  in   the    same  order  as    in  the 
initial  sequence  and   conjunction  is  associated  to  the  left.         Thus 
for  example,   where    n=  4,   G*    is 

(((-G^-O,)  &G3)  &-G4). 

According  to  our  conventions  we  may  omit  parentheses  when  re- 
ferring to  G*,  but  we  must  understand  that  the  parentheses  are 
to  be  restored  in  a  unique  way.  It  should  be  kept  in  mind  that  ,*1 
occurs  in  our  metalinguistic  description  of  questions  and  answers, 
but  not  [normally]  in  the  questions  and  answers  themselves. 

Definition  7.1:  F  is  the  prime  disjunction  in  a  sequence 
G, ,  .  .  .  ,  Gn  of  statements  if  and  only  if  n  >  1  and  F  is 
( (  C  -  .  ( (Q*"  v  G2*)  v  G3*)  v  .  .  . )  v  G^*)  v  Ga*). 

According  to  our  wider  usage  of  'disjunction'  and  'disjunct'  , 
we  say  that  Q  ,  .  .  .  ,  G  *  are  the  disjuncts  of  this  disjunc- 
tion. Note  that,  by  the  logic  of  disjunction,  the  inclusive  dis- 
junction in  a  prime  disjunction  behaves  like  exclusive  disjunction. 

Theorem   7.1:   For    every    n  >1  ,    there   are    sequences 
G  ,  .  .  .  ,  Gn    of  wffs    such  that  there  exists  a  true  prime 
disjunction- in    G  ,  .  .  .  ,  G    . 

3  1 
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Proof:   Choose  an   L-true   wff  as    G    ,    and  the    closure   of 
an   L-false   wff  [-G,  ,  for  example]  as  each  of  G2,    .  .  .  ,    Gn. 

Definition  7.2:    F    is  an   n-place  disjunctive  question   if  and 
only  if    F    is  a  prime  disjunction  in  some  sequence  Gx ,  .  .  .  .G 
of  statements. 

Hereafter  we  shall  usually  omit   'n-place1   before    'disjunctive 
question'  . 

Definition  7.3:   A  direct  answer  to  a  disjunctive  question    q 
is  a  disjunct  of    q. 

The  foregoing  definitions  secure  that  each  sequence  of  state- 
ments generates  exactly  one  disjunctive  question,  and  that  a  dis- 
junctive question  has  at  most  one  true  direct  answer.  A  disjunc- 
tive question  F  says  that  exactly  one  G:  is  true;  a  direct  an- 
swer to  F  says  both  that  G-  is  true  ari~d  that  all  the  other  G's 
are  false. 

Next  we  analyze    "which"   questions.  These  differ  from  whether 
questions  in  that  whether  questions  ask  about  a  small  or  at  least 
finite  set  of  possibilities,   whereas  which  questions   ask  about  an 
infinite  or  indefinite  or  relatively  large  set  of  possibilities.  Ques- 
tions like    'Which  of  Webster,    Lily,    and  Jonson  do   you  prefer?' 
need  not  be  construed  as  which  questions,   for   they   can  be   ren- 
dered as  whether  questions.        Proper   which  questions    include 
questions  like  the  following: 

Which  plays  did  Marlowe  write? 

Who  [that  is,  which  person]  wrote  Bacon's  essays? 

When  [that  is,  at  which  time]  did  Shakespeare  write  Othello? 

How  many  [that  is,  which  number  of]  lines  did  Milton  write? 

Because  the  set  of  answers  to  a  whether  question  is  relatively 
small  and  the  set  of  answers  to  a  which  question  is  relatively 
large  or  indefinite,  the  psychology  of  answering  questions  of  one 
type  differs  from  that  of  the  other.  Psychologically,  questions 
of  the  first  kind  are  "multiple -choice"  questions,1  and  questions 
of  the  second  kind  are    "fill  in  the  blank"    questions. 

Theorem  7.2:    For  some  existential  generalizations     F,  the 
quantified   function  of    F    contains    free    variables,  and     F 
is  true. 

Proof:  By  LA  7,    some    f(a    .  .  .  a^)    is  true.  By  LA    15  (2), 
[(*)  -i(xa2-  •  •  an)  ->  -  ^(ax  .  .  .  aQ)]    is   provable,    so 
[--f(a    .  .  .  a   )  -»>~'(x)  -  f(xa2  .  .  .  a.n)]    is  provable  also,    as 
is    [f(a.2  .  .  .  a"n)  ->  (Ex)f(xa2  .  .  .  a. J]. 

Definition  7.4:   G    is  a  complete  list  with  respect  to    FXj  .   .  .  xn 
if   and   only   if    n  >  0  ;     x    ,     .  .  .  ,    x       are  distinct   variables- 


1  This  was  pointed  out  to  me  by  Nuel  D.  Belnap,  Jr. 
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occurring  free  in    F;    no  other  variables  occur  free  in     F; 

and    G    is  the  statement    (x  )  .  .  .  (xn)[Fx    .  .  .  xn  < — > 

['(x,  ="k,  &.  .  .  &  xn=   aa)  v  .  .  .  v  (x~=  bj  &.  .  .  gTxn=  b^)]  ] 

where 

(1)  a    ,     .  .  .  ,    an,     .  .  .  ,    b  ,     .  .  .  ,    bn    are   individual 
constants  as  specified  by  LA  3,    and 

(2)  there  are    m,   where    m  >  0  ,    disjuncts    (x    =  c,    & 

•  ■  •  &  ^n=  £n)- 
Note  that    Fx    .  .  .  xn     can  have  more  than  one    complete  list, 
not  only  one  for  each  number  of  disjuncts,   but  several  for    each 
number  of  disjuncts.     If    G    and    H    are  complete  lists  with   re- 
spect to    Fx.  .  .  .  xn   having  the  same  number  of  disjuncts,  G   and 
H    can   differ    in   their  sequences  of  constants  but  not  in   the    ar- 
rangement of  their  parentheses. 

Theorem  7.3:   A  complete  list  with  respect  to     Fx    .   .  .  xn  is 
L-equivalent  to 

£aj.  .  .  aa&.  .  .  &  Fbj.  .  .  &£  &  (x2)  .  .  .  (xs)  [[-(*!  =  ^  & 
.  .  .&xs=aa)&.  .  .&  -(Xj  =  bi  &.  .  .  &  Xn  =  kn)]    "♦ 
-  Fxx  .  .  .  xj  . 

This  is  a  theorem  of  the  logic  of  identity.     According    to   this 
theorem,    a  complete  list  with  respect  to    Fx   .  .  .  xn    asserts 
that   (1)    certain  things   satisfy  the  condition    F    and    (2)  no  other 
things  satisfy    F  . 

Definition  7.5:     F    is    a   which   question   if   and   only   if     F    is 
an   existential   generalization  of   some    G     with   respect  to 
x    ,  .  .  .  ,  xn,    where    x    ,     .   .  .  ,    xn    are  distinct  variables  oc- 
curring free  in    G  ,    n  >  0  ,    no  other  variables  occur  free    in 
G,    and    G    is  not    (Ev)H. 

Without   the   final    clause    in   the    above    definition,    wffs    like 
(Ex)(Ey)Fxy  would  be  ambiguous;    see  Example  (5)  in  Section  8. 

Definition  7.6:    H    is  a  direct  answer  to  a  which   question    F 
if  and  only  if    F~is     (Ex  )  .   .  .  (Exn)G  ,  where    G    is  not    (Ev)F' 
and    H    is  a  complete  list  with  respect  to    G  . 

To  refer  to  questions    more    generally  we   adopt   the    following 
two  definitions : 

Definition  7.7:    q    is  a  question  if  and  only  if    q    is    either    a 
disjunctive  question  or  a  which  question. 

Definition  7.8:     d    is  a  direct  answer  if  and  only  if    d     is    a 
direct  answer  to  a  question. 

Hereafter  we  shall  frequently  use    fq'    or    'q.'     to  refer  to  ques- 
tions,   and  {df     or    !d(q.)f    or    'd.T     to    refer    to  "direct  answers    to 
q  ..     Whenever  we  use-  'd1  ,    it  "should  be  clear  from  the    context 
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whether  we  are  using    'd'    to  refer  to  a  direct  answer  or  to  one  of 
the  individual  constants  specified  by   LA  3. 

The  above  two  definitions  are  intended  solely  to  simplify  the 
exposition  in  this  book.  Whenever  further  types  of  question  are 
analyzed,  we  may  add  them  to  the  present  analysis  and  revise 
Definition  7.7  accordingly. 

Definition  7.9:    We  use  the  question  mark  as  follows: 
(1)    If    F    is    the    disjunctive    question   in   the    sequence 
G  ,  .  .  .  ,  Gn,    we  shall  refer  to   F    by   »(G  .  .  .  Qn?)' 
or    'G  .  .  .  CL?f. 


—l 


— n 


(2)  We  shall  also  use    '(G?)'    or    'G?'    to  refer  to  the   dis- 
junctive question  in  the  sequence    G  , -G  . 

(3)  If    F    is  a  which  question,    with    G    its  quantified  func- 
tion and    x   ,     .  .  .  ,    xn    its  quantified  variables,    we 
shall   refer    to     F    by  ~?(GXj.   .  .  xn)»    or    >?Gx    .   .  .  xn' . 

This  definition  is  quite  important  for  the  interpretation  of  our 
approach  to  the  analysis  of  questions.  According  to  our  analysis, 
the  question  mark  is  not  needed  in  the  question-and-answer  proc- 
ess. According  to  the  above  definition,  the  question  mark  may 
be  used  on  the  metalinguistic  level  as  a  device  of  abbreviation. 
It  is  a  convenience,   but  a  dispensable  convenience.2 

Definition  7.10:  F  commits  the  fallacy  of  many  questions  if 
and  only  if  F  is  a  false  prime  disjunction  in  some  sequence 
of  statements  or    F    is  a  false  existential  generalization. 

In  justification  of  this  definition  we  offer    Theorem   7.6    (2)    in 
this  section  and  example  (1)  in  the  following  section. 


Theorem  7.4:    If    F    is  a  statement,    then    F?    is  an  effec- 
tively constructible  and  effectively  recognizable    question, 
F?    is  effectively  recognizable  as  being    L-true,    and      F 
is     L-equivalent  to  a  direct  answer  to    F?. 

The  latter  result    should  please   those   who   view  intellectual 
history  as  a  dialectical,    question-and-answer  process.3 

Theorem  7.5:  For  every  question  q,  every  direct  answer 
to  q  is  effectively  constructible  and  effectively  recogniz- 
able as  being  a  direct  answer  to    q  . 

Proof:  This  comes  via  the  fact  that  disjuncts  and  complete 
lists  can  be  effectively  constructed  from  the  grammatical 
forms  of  the  questions  to   which  they  are   direct  answers. 


2  In   this    respect   the   theory  presented  here    contrasts    sharply  with 
many  other    analyses    of   questions.        See,    e.g.  ,   Kubinski   [1958];    Hiz 
[1961];    Steinmann  [1959];    and  Freudenthal  [I960],   p.  20. 

3  E.g.,    Collingwood  [1940],    and  Ritchie  [1948],  Chap.  VIII. 
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Theorem  7.6:    Qeustion-asking  is  safe,  in  the  sense  that: 

(1)  No  question  is  meaningless. 

(2)  If  a  question    q    is  true,    then    q    does  not    commit  the 
fallacy  of  many  questions,    and    q    is    consistent   with 
any  true  wff. 

(3)  F?     does  not  commit   the    fallacy   of  many   questions, 
and    F?     is  consistent  with  any  true  wff. 

Proof:   See    LA  13  and  18  . 

We  interpret  Theorems  7.4,    7.5,    and  7.6  as  combining  to  guar- 
antee that  to  every  rational  interest  there  corresponds  a  rational 
question. 

The  next  theorem  lists   some     L-truths  which  have  little  inter- 
est by  themselves  and  have  little  importance  for  the  developments 
below.     The  point  of  listing  these    L-truths    is    that  they  reveal 
some  contrasts  between  our  theory  and  some    alternative    treat- 
ments of  questions.4    Incidentally,  the  same  sort  of  role  is  played 
by  Theorem  7.15. 

Theorem  7.7:   Where    F    and    G    are  statements,    the  fol- 
lowing are    L-true: 

(1)  (F?   &T G?)  *-+   [(F  ><   -F)  &  (G  ><   -G)] 

(2)  (F  &  G)?   *-->  [(F  &  G)  X   -(F  &  G)] 

(3)  (F  &  G?)   *-^   [F  &  (G  ><   -G)] 

(4)  -(F?)  «-*    (F  &  -F) 

(5)  (-F)?   <-*    (-F  ><   --F) 

(6)  (F?  v  G?)   <--+   [(F  ><   -F)  v  (G  ><   -G)] 

(7)  (F  v  G)?   «—>   [(F  v  G)  ><   -(F  v  G)] 

(8)  (F?  ■*  G?)   *-*   [(F  ><   -F)  ->  (G  ><   -G)] 

(9)  (F-G)?   — >   [(F-*G)><   -(F->G)] 

(10)  (F?  ->  G)  «-->   [(F  ><   -F)  ■*  G] 

(11)  (F  ->  G?)   <-->    [F  ->  (G  ><   -G)] 

(12)  (Fj.  •   •  F.F.  .  .   .  Fa?)    —  (Fr   .   .  FjFi  .   .   .  F^?) 

(13)  F?   <~ >   G?  " 

(14)  F?  *-->   F?? 

(15)  F  ->   G  ? 

Theorem  7.8:   It  might  be  the  case  that  not  all  questions   are 
L-equivalent,  and  that  not  all  questions  are  true. 


4  See,  e.g.  ,    Castaneda  [1957],   and  Kubinski  [1958], 


36  communication:     a    logical    model 

Theorem  7.9:  If  F  is  a  question,  then  -F  is  not  a  ques- 
tion;   more  generally,   not  all  wffs  are  questions. 

Theorem  7.10:    d(q)    L-implies    q  .5 

Proof:   For  the  case  of  disjunctive  questions:   d(q)    L-implies 
[d(q)  v  F]  ,   in  particular   where    F    is    a   disjunction  of  the 
other  direct  answers  to    q  .  For  the  case  of  which  questions, 
see  Theorem  7.3. 

Theorem  7.11:   Consistency  of    d(q  )    implies  consistency  of 
[d(q  )  &  q  ]    but   not    consistency   of     [d(q  )    &  q    &  q   ]  . 
Proof:    Consistency  of    [d(q)  &  q]     holds    because    of  Theorem 
7.10.     For  a  case  where    [d(q  )  &  q    &  q   ]     is  inconsistent,  let 
q     be    (F*  v  G*)  ,    q     be    (F*~v  H*)  ,  and    d(q  )    be    F*.  where 
H    is    L-true,    G    is  true  but  not   L-true,    and   F    is    false  but 
not    L -false. 

Theorem  7.12:  Every  n-place  disjunctive  question  has  ex- 
actly n  direct  answers.  Every  which  question  has  denu- 
merably  many  direct  answers. 

Proof:    This  follows  from  the  definitions  involved,    and  the 
fact  that    L    has  denumerably  many  individual  constants. 

Theorem  7.13:   It  might   be   the    case   that    some   question   is 
true  but  has  no  consistent  direct  answers,   or  has  consistent 
direct  answers  but  no  true  direct  answers. 
Proof:    If    Fx    .  .  .  xn    is  satisfiable  in  an  infinite   domain 
but  not  in  any  finite  domain,     ?Fx    .  .  .  x„    has  no  consistent 
direct  answer.    For  an  example,      take    Fv    as 

(Eu)(Gvu)  &  (w)[(Ex)(Gxw)  ->  (Ex)(Gwx)] 

&:  (u)(w)(x)[(Guw  &;  Gwx)  ->  Gux]  &:  (wJ(-Gww)  . 

If  the  domain  D  contains  infinitely  many  things  which  satisfy 
Fx,  then  ?Fx  has  many  false  answers  —  each  giving  a  finite 
list  —  but  no  true  direct  answer. 

Theorem  7.14:  A  sufficient  condition  for  a  which  question  to 
have  at  least  one  false  direct  answer  is  that,  for  some  pair 
of  constants  a  and  b  as  specified  by  LA  3,  (a  ^  b)  is  true. 
Proof;  Consider  the  which  question  (Ex)Fx.  Direct  answers 
to  this  include  (x)fFx  < — >  (x  =  a)]  and  (x)|"Fx  < — >  (x  =  b)].  If 
we  assume  that  both  of  these  answers  are  true,  we  may  infer 
(a  =  b)  ,    contradicting  our  hypothesis. 

Theorem  7.15:    It  might  not  be  the  case  that: 

(1)    If    d    is  a  direct  answer  to     F?,  then    d    L_-implies    a 
direct  answer  to     ( F  v  G)  ? 


5  This  theorem  was   pointed   out   to   me   by   Belnap.        It   corresponds 
closely  to  Definition  III  2-5  of  Kubinski  [1958]. 
Pointed  out  to  me  by  David  Kaplan. 
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(2)  If    H    does  not    L-imply  a  direct  answer  to     F?,    then 
H    does  not    L_-imply  a  direct  answer  to     (F  &  G)? 

(3)  If    d    is  a  direct  answer  to    F?    and   F?    L-implies   G?, 
then    d    L-implies  a  direct  answer  to    G  ? . 

Proof:     For    (1),    let    d    be     (-F)*.     For  (2),    let    H    be     -G. 
In   (3),    G?    is    L-implied  by  every    q  . 

The  next  theorem  is  important  for  the  theory  of  test  questions. 
We  cannot  define    'test  question1    until  later,   but  we  can    say  now 
that  a  test  question    q    is  a  question  such  that,  inter  alia,  the  ques- 
tioner knows  a  true  direct  answer  to    q  .      Trouble  arises  when   q 
has  more  true  direct  answers  than  the  questioner   has   listed   on 
his  answer  sheet.     The  questioner  then  needs  to  know  under  what 
conditions  he  can  ask  a  test  question  and  reject  any  reply  which 
is  not  L-equivalent  to  the  answers  on  his  answer   sheet.  For  true 
disjunctive  questions  the  matter  is  simple,   because  at  most  one 
direct  answer  can  be  true.      For  which  questions  we  have: 

Theorem  7.16:    Let    d,    and    d     both  be  true  direct  answers 

—1  —2 

to  a  true  which  question    q.     Suppose  that,  for  every  pair  of 

individual  constants     a    and    b    occurring  in  the  disjuncts  in 

d      and    d    ,    if    (a  =  b)     is  true  it  is    L-true.     Then    d     and   d 

are    L-equivalent. 

Proof:    q    is     (Ex  )  .  .   .  (Exn)Fx    .  .  .  xn.  We    shall   lose 

nothing  in   generality  if  we- let    n  =  1.      Then  d     and   d      are 

(1)  (x)[£x  *-*    (x  =  ax  v  .  .  .  v  x  =  &m)] 

(2)  (x)[Fx«    (x  =  b,  v.   .   .  v   x  =  b^)]. 

Now  suppose  that  some    a_£    and    a      in   (1)    are    such  that 
(a_£  =  a„)    is  true;    then  (a.7  =    ag)   ~~  is    L-true   and  we    can 
drop      ~" (x  =  a   )     from    (1)7     obtaining    (l')     which   is    L- 
equivalent  to  -(1).    Thus  we  shall  lose  nothing  in  generality 
if  we  suppose  that   (1)   has  already  been  "condensed"  so  that 
no  such  identity  of    afs     is  true.     Analogously  we    may  sup- 
pose that    (2)   has  been  so  condensed.      Theorem  7.3  assures 
us  that   (1)    L-implies 

(3)  Fai  &.  .  .  &  Fam, 

so  that  the  conjunction  of   (1)    and   (2)    L-implies 

(4)  (ax  =  bj  v  .  .  .  v  a1  =  b^)  &  .  .  .  &  (a^=    b i  v  .  .  .  v  a^  =  b ^) 
which  can  be  converted   into 

(3)    (ai=  bi  &.  .  .  &  am=  bi)v...v(ai=bk&...&  ^m  =  ^) 
Since    (d      &    d2)    is  true,    at  least  one   of  the    disjuncts    of 
(5)    is  true;  let  us  consider 

(6)    (a1=  b_i&.  .  .  &  3^=  bj). 
Since  these    m    identities  are"true,   they  are    L-true,  so  (6) 
is    L-true.     Now  suppose  there  are  more  disjuncts    in    (1) 
than  in   (2).     If  this  is  so,    then  even  if    b- ,     .   .   .  ,    b^     are 
distinct,    two  or  more  of    a,     .  .  .  ,     a        will   be    identified 
with  some  one    b  ,    so  some    (a.  =  a   )      is  true,  contradicting 
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our  assumption  that    (1)    had  been  condensed.   A  similar  line 
of  reasoning  can  show  that  there  cannot  be  more  disjuncts  in 
(2)   than  in   (1),    and  so    (1)    and   (2)   have  the  same  number  of 
disjuncts.     Also,    a    ,     .  .  .  ,    a_m    are  distinct,   and   so   are 
b-,     .  .  .  ,    b-    in   (6).  HenceTTsetween  the  disjuncts  in  (1) 

and  (2)  there  is  a  one-one  correspondence  such  that  corre- 
sponding disjuncts  are  L- equivalent,  from  which  it  follows 
that    d      and    d      are    L-equivalent. 


8.   A   LOGIC    OF  ANSWERS 


We  next  proceed  to  lay  the  foundations  for  a  quantitative  analy- 
sis of  replies.  Since  R  wants  to  pay  for  a  message  only  to  the 
degree  to  which  the  message  answers  his  question,  he  must  be 
able  to  distinguish  different  degrees  of  answering.  Our  analysis 
of  these  matters  is  presented  in  two  stages.  First,  in  this  sec- 
tion we  distinguish  partial  answers.  Second,  in  a  later  section 
we  shall  show  how  quantitative  functions,  particularly  the  infor- 
mation function  I ,  can  be  applied  in  a  reasonable  way  to  sets  of 
direct  and  partial  answers. 

Note  that   it  is  only   the    direct  and   partial   answers  which   are 
relevant  to  our  analysis  of  communication.      It   might  be    thought 
that  we  should  be  interested  in  complete  answers,  where  a  com- 
plete answer,    roughly,    is  a  message    which    L-implies  a   direct 
answer.      But  if    R    were  required   to  pay   for  complete  answers, 
over  and  beyond   the  direct  answers    which  they  imply,    he   would 
usually  have  to  pay  too  much,    in  the  sense  that  he  would  have  to 
pay  for  parts  of   the  message   which  did  not  interest  him.       It   is 
easy   enough   to    define    complete    answers,1    but   here   we    shall 
ignore  them. 

Definition  8.1:  F  is  a  partial  answer  to  a  question  q  if 
and  only  if  there  is  some  consistent  direct  answer  d  to  q 
such  that    d    L-implies     F. 

This  definition  is  perhaps  the  most  liberal  definition  we    could 
reasonably  formulate.2     Of  course,     d    must  be    consistent,    for 
otherwise    a   question   with   inconsistent   direct   answers    would 
have    every  wff  as    a  partial   answer,    including   all   inconsistent 
wffs,   so  that    R    would  have  to  pay  too  much.    It  might  be  thought 
that  we  should  exclude    all   L-true   wffs   from  being  partial   an- 
swers,  because  if    q    has    any   consistent   direct   answer    it   has 
every   L-true  wff  as  a  partial  answer.       We    can  tolerate    L-true 
partials,   however,    for  on   the   payment    system   to   be   presented 
later     R    does  not  have  to  pay   anything  for  an   L-true    wff.         We 
must  allow   L-true  partials,    furthermore,    if  we    are  to  allow    S 
to  give  partial  answers  in  reply  to   test   questions   with   L-true 


1  See,  e.g.  ,  D10  of  Harrah  [1961].  Belnap  has  suggested  that  a  com- 
plete answer  should  be  allowed  to  imply  even  an  L-true  direct  answer, 
since  the  burden  is  on    S    to  show  that  the  direct  answer    is    L-true. 

2  This  definition  was  suggested  by  Belnap;  it  seems  to  be  more  suit- 
able for  the  needs  of  this  book  than,  say,  Dll  of  Harrah  [1961]. 
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direct  answers.  Another  possible  objection  to  Definition  8.1  is 
that  some  direct  answers  are  also  partial  answers,  which  seems 
intuitively  unsatisfying.  But  the  direct  answers  to  q  are  not 
always  included  in  the  partial  answers  to  q  ,  and  we  can  always 
speak  of  'partial  answers  to  q  which  are  not  direct  answers  to 
qf  ,  and  in  any  case  the  payment  system  to  be  presented  later 
can  accommodate  any  overlapping  of   direct  and  partial  answers. 

Theorem  8.1:    Some  questions  answer  themselves,  as  follows: 

(1)  No  question  is  a  direct  answer  to  itself. 

(2)  If  a  question    q    has    any   consistent   direct   answers, 
then    q    is  a  partial  answer  to  itself. 

(3)  Some  questions    L-imply  direct  answers  to  themselves. 
Proof:   Part  (1)  holds  because  of  the  grammatical  forms  in- 
volved.    Part    (2)    holds  because  of  Theorem  7.10.  Concern- 
ing Part  (3):    FG?    L-implies     F*    if  and  only  if  G    L-implies 
F;3       this    can  be    shown   as    follows;     First,     [(G  -*  F) 

&  [(F  &   -G)  v  (G  &   -F)]]     L-implies  (F  &   -G).     Second,     if 
[  [(F  &  -G)  v  (G  &  -F)]   -*  (F  &  -G)]   is  provable,  so  is  (G ->  F). 

Theorem  8.2:   It  might  be  the  case  that  some   questions  have 
no  partial  answers,   but  if  a  question  has  any  partial  answers 
it  has  infinitely  many. 

Proof:   Some  questions  might  have  no  consistent   direct  an- 
swers.    But  if    F    is  a  partial  answer,    so  is    (F  &  F) . 

Theorem  8.3:   An   L-false  statement  can  be  a  direct  answer 
to  a  question,  but  it  cannot  be  a  partial  answer. 
Proof:   See  Theorem  5.4  (4). 

Definition  8.2:   A  question    q      answer -contains  a  question   q 
relative  to  a  wff    F    if  and   only  if  every    d_x    is    such  that 
(F  &  d  )    L-implies  some    d    . 

Definition  8.3:    Two  questions  are  answer -equivalent   rela- 
tive  to  a  wff    F    if  and  only  if  each  answer-contains  the  other 
relative  to     F  . 

These  definitions  are  intended  to  specify  a  concept  which  R 
can  use  for  economizing  in  his  question-asking.  The  F  param- 
eter is  introduced  for  the  sake  of  generality;  in  communication 
situations  it  might  be  R's  knowledge  k.  If  R  knows  that  some 
d.  is  true  and  that  q  answer -contains  q  relative  to  k,  then 
he  will  ask    q     but  not    q   ,   because  every    d     will  "include"  some 

d   ,    and  if    k    is  true  then  the  true    d     will  include  a  true    d    .   It 

—z  —  —l  —2 

is  because  of  this  intended  application  that   we    can  tolerate   the 

liberality  of  Definition  8.2.     Where  no    d      is  such  that    (F  &c  d  )  is 


3  This  was  pointed  out  to  me  by  Belnap, 
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consistent,  q  answer -contains  every  q  relative  to  F  ,  but  we 
can  tolerate  This  situation  in  our  theory  because  R  can  avoid  it 
in  practice;  R  can  so  choose  his  k  and  his  questions  that  trivial 
cases  of  this  sort  do  not  arise. 

Theorem  8.4:  Answer-containment  relative  to  F  is  reflexive 
and  transitive.  Answer-equivalence  relative  to  F  is  reflex- 
ive,   symmetric,    and  transitive. 

Proof:    For  every    d ,     (F  &  d  )     L-implies     d    .  If   every 

(F  &  d  )     L-implies  some     d      and  that   d_2  is  such  that   (F  &  d_2) 
L-implies    some     d    ,    then  every    (F  &  d  )     L-implies  some   d3. 

Theorem  8.5:     F?     and   (-F)?     do  not  have  the   same  direct  an- 
swers,  but  they  are  answer-equivalent  relative  to  any  wff     G  . 
Proof:    These  questions  are    [F*  v  (-F)*]    and    [(-F)*  v  (--F)*], 
so  each    d      L-implies  a    d    ,    and  each    d      L-implies   a     d   . 

One  might  think  that  'Is  it  the  case  that  .  .  .  ?'  and  'Is  it  not 
the  case  that.  .  .  ?'  are  in  some  sense  the  "same"  question,  and 
that  questions  therefore  have  no  logic,  or  have  a  logic  different 
from  that  of  statements.5  Theorem  8.5  shows  that  this  is  not  the 
case,  for  it  shows  that  F?  and  (-F?)  are  equivalent  in  a  rea- 
sonable sense,  even  where  these  questions  are  construed  as  statements. 

%     >;<     >;«     >;<     t^ 

We  shall  now  consider  a  number  of  examples.    For  the  sake  of 
smoothness  of  exposition  our  discussion  will  be    relatively  in- 
formal.    We  shall  assume,    in  effect,    that    L    is  a  reconstructed 
version  of  English,    and  we  shall  occasionally  use    'or  else'    in 
place  of   'v!  . 

(1)    Consider    'Have  you  stopped  beating  your  wife?'.      If  this 
is  construed  as    'You  have  a  wife  and  have  beaten  her  and 
now  I  want  to  know  whether  you  have  stopped'  ,  then  it   is 
asked,  according  to  our  analysis,  by  asserting   '(You  have 
a  wife  and  have  beaten  her  and  have  stopped)*     or    else 
(You  have   a  wife   and  have  beaten  her    and  have   not 
stopped)*'.     If   "you"  does  not  have  a  wife,  or  has  one  but 
has   not  beaten   her,    then   this    disjunction   is    false    and 
commits  the  fallacy  of  many  questions.     The  safe  way  to 
ask  about  wife -beating  is  to  assert   '(You  have  a  wife  and 


4  Definition  8.2  is  a  generalization  and  improvement  on  D18  of  Harrah 
[1961];  I  am  grateful  to  Belnap  for  suggestions  and  proddings  in  this  regard. 

5  This  is  roughly  the  position  of  Prior  [1955],  A.  N.  Prior  has  written 
me,  however,  that  he  has  become  less  skeptical  and  has  done  some  work 
toward  a  calculus  of  questions. 

Note  that  it  would  not  be  reasonable  to  expect     ?Fx  and    ?-Fx    to   be 
answer -equivalent;    cf.  Fries  [1952],   pp.  166-172. 
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you  have  beaten  her  and  you  have  stopped)*  or  else  (It  is 
not  the  case  that  you  have  a  wife  and  that  you  have  beaten 
her  and  that  you  have  stopped)*1.     The  direct  answers  to 
this  latter  question  are  equivalent   to   the   assertion  and 
the  denial  of  the  three -component  conjunction.         Direct 
answers  to  this  question  are    L-implied  by  'I  don't   have 
a  wife1,    'I  have  a  wife  but  haven't  beaten  her',  and  so  on. 

(2)  Suppose  'If  you  wear  a  coat,   will  you  wear  a  hat?1  is  ren- 
dered by  '(If  you  wear  a  coat  then  you  will  wear   a  hat)* 
or  else    (It  is  not  the  case  that  if  you  wear   a   coat  then 
you  will  wear  a  hat)**.      To  this  question  a    'yes'    answer, 
meaning    'If  I  wear  a  coat  then  I    shall  wear    a  hat'  ,    L- 
implies  a  direct  answer;    but  a    'no'    answer,  meaning  'If 
I  wear  a  coat  then  I  shall  not  wear    a  hat'  ,    is   a  partial 
answer  but  does  not    L- imply  any  direct  answer.  The 
reason  for  this  is  that     -(F  ->  G)     L- implies     (F  ->  -G) , 
while  the  converse  does  not  hold.      This  asymmetry  cor- 
responds to  a  difference  in  the  content  of  the  two  answers. 
Whereas    'no'    gives    at  best   a   necessary   condition   for 
wearing  a  hat  —  namely,   not  wearing  a  coat  —  'yes' 
gives  a  sufficient  condition,   which   is   what   the   question 
calls  for. 

(3)  Consider  (a)  'Which  of  Van  Gogh's  ears  was  cut  off?' 
and  (b)  'Was  Van  Gogh's  right  ear  cut  off?1  Suppose  a 
denotes  Van  Gogh's  right  ear,  b  denotes  Van  Gogh's 
left  ear,  F  expresses  the  property  of  being  an  ear  of 
Van  Gogh,  and  G  expresses  the  property  of  being  am- 
putated from  Van  Gogh,  so  that  these  questions  may  be 
rendered  by 

(a)  (Ex)  (Fx  &  Gx) 

(b)  Ga*  v  (-Ga)* 

Now  suppose  the  reply   'Just  Van  Gogh's  left  ear  was  cut 
off    is  given;    let  us  render  this  as 

(c)  (x)[(Fx&Gx)<-^x=b] 

Then   (c)    is  a  direct  answer  to    (a).  It   L-implies  a  direct 
answer  to    (b)    if  there   is    an   Lj- truth  guaranteeing   that 
Van  Gogh's  right  and  left  ears  were  distinct,    or  else   an 
L-truth  guaranteeing  that  his  ears  were  identical,  that  is, 
that  he  had  only  one  ear.     Suppose  the  reply   'Van  Gogh's 
right  and  left  ears  were  both  cut  off    is    given;     let   us 
render  this  as 

(d)  Ga  &  Gb 

Then   (d)    is  not  a  direct  answer  to  either    (a)    or    (b),   but 
it    L-implies  a  direct  answer  to    (b)    and  is  a  partial  an- 
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swer    to     (a)  .       It    L-implies    a   direct    answer    to     (a)     if 
there    is    an   L-truth  guaranteeing  that  Van  Gogh   had   no 
ears  other  than  his  right  and  left  ones. 
(4)    'If  the  reserve  has  disappeared,   then  whose  savings   are 
intact?'    might  be  rendered  as     [F  ->  (Ex)Gx]?.    If  so,  and 
if    [E  ~*  (E2.)Gx]*    is  given  as  a  direct  answer,   the   next 
question  might  be     F?.     If    F*    is  given  as    a    direct   an- 
swer to  this  second  question,    the  third  question  would  be 
(Ex)Gx  .      The    reply    'Morgan's  savings  are  intact'     L- 
implies  a  direct  answer  to    the    first   question  but   not   to 
the  third,   because  it  implies  that  someone's  savings  are 
intact  but  fails  to  provide  a  complete  list.  This  difference 
corresponds    to    a   difference    in   relative    information. 
'Morgan's  savings  are  intact'   conveys  more  information 
relative  to  the  original  disjunction,    which  is  a  tautology 
with  zero  information,    than  it  does  relative  to    the    final 
which  question,    which  is  a  nontautology  with  positive  in- 
formation. 

A  different  way  of  rendering    'If  the  reserve  has  disap- 
peared, then  whose  savings  are  intact?1  is  via   [F  ik  (Ex)Gx]?. 
This  question  conveys  as  much  information  as    the    first 
question,    namely,    none;    but  the  information   content   of 
their  direct  answers  might  differ.       The    direct   answers 
to  the  first  are   L- equivalent  to 

[-F  v  (Ex)Gx]    and    [F  &  -(Ex)Gx] 

while  those  of  the  second  are    L-equivalent  to 

[F  &  (Ex)Gx]    and    [-F  v  -(Ex)Gx]  . 

Incidentally,    when    R    asserts,  for  example,     [(Ex)Fx]  ? 
and    S    replies     (Ex)F  ,    R    is  thereby  confronted  with  what 
we  have  defined  to  be  a  question.    He  might  know  the  cor- 
rect  answer    and  proceed   to    give    it;     this    could  be    the 
case  where  he  asked  the  original  disjunctive   question  to 
test    S  .     On  the  other  hand  he  might  not  know  the  answer; 
in  this  case  he  is  free,    according  to  our  analysis,    to  re- 
peat   S's    reply,    so  that    S '  s    reply  now  becomes  a  ques- 
tion put  to    S  . 
(5)    Assuming  that  some    of   the   boys    are   brothers    of   each 
other,    'Which  boys  are  brothers  of  each  other?'  is  ren- 
dered by   (Ex)(Ew)(Fx  &  Fw  &  Gxw)  .     'Wilmer    and    Zane 
are  brothers  of  each  other,  and  no  other  boys  are  brothers 
of  each  other'    is  a  direct  answer.    'Wilmer  and  Zane  are 
brothers  of  each  other'    is    a  partial   answer,    and   so    is 
'Wilmer  is  somebody's  brother'  .  If  the    registration 

clerk  is  in  a  hurry  and  doesn't  want  to  be  swamped   with 
details  of  who  matches  up  with  whom,   but  wants  to   know 
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only  which  boys  are  used  to  close  companionship,  he   asks 
not    (Ex)(Ew)(Fx  &  Fw  &  Gxw),   but  rather    (Ex)  --  (Ew)(Fx 
&  Fw  &  Gwx)    or    (Ex)[Fx  &  (Ew)(Fw  &  Gxw)]     or    some 
analogous  variant.     The  direct   answers   to   these   latter 
questions  are  lists  of  names,    rather  than  lists    of  pairs 
of  names. 

(6)  'What   are   the   prime   numbers?'     can  be    rendered  by 
(Ex)(Fx  &  Gx)  .     If  it  is  so  rendered,   no   direct   answer 
to  it  is  true,    because  there    are  infinitely   many  primes. 
It  does  have  infinitely  many   true  partial   answers,   how- 
ever,   such   as    'Two    is    a   prime    number',     'Three    is    a 
prime    number',     'Two    and   three    are    prime   numbers'. 

(7)  Consider    'Will  you  wear  a  coat?'  ,    'Will  you  wear  a  coat 
or  hat?1  ,    'Will  you  wear    a   coat  and  hat?'  ,    and   'If  you 
wear  a  coat,   will  you  wear  a  hat?'  .     Suppose  these,  re- 
spectively,   are  rendered  as 

F*  v  (-F)* 

(F  vG)*  v  [-(F  v  G)]* 

(F  &  G)*  v  [-(F  &  G)]* 

(F  -  G)*  v  [-(F  ->  G)]* 

Let    H    be  the  conjunction  of  all  these  questions.        Then 
none  of  these  questions  answer -contains  any  of  the  others 
relative  to    H  . 


9.   QUESTION   SETS 


For  developing  our  model  of  communication  it  is  most  conveni- 
ent to  regard    R!s     questions  as  composing  a    set   which   is    non- 
empty but  finite.    Even  with  only  finitely  many  questions,     R    can 
use  which   questions  to  ask   about  infinitely  many   things.        What 
our  analysis  prevents  him  from  doing  is  being    "interested  in"  — 
that  is,    asking    questions    formulated   in   terms    of  —  infinitely 
many  wf f s .    He  need  not  feel  hampered  by  this  restriction,   since 
we  allow   him  to   be  interested  in  an  indefinitely   large  number  of 
wffs,    and  to  extend  any  question  set  as  far  as  he  wants.    For  the 
logic    of  question    sets    we    can   define    concepts    and   establish 
theorems  analogous  to  those  for  the  logic  of  individual  questions. 
We  shall  present  here  only  a  few. 

Definition  9.1 :   A  proper  question    set   is    a    set  whose   only 
members  are    n    questions,   where     0  <  n  <  oo  .   A  true  proper 
question  set  is  a  proper  question  set  all  of  whose   members 
are  true. 

Hereafter  we  shall  usually  say  'question  set'  in  place  of  'proper 
question  set',    and  we  shall  use    'Q-'     to   refer   to   a  question   set. 
We  shall  use    'CCh'    to  refer  to  the  alphabetically  first  conjunction 
of  all  the  questions  in    Q^    in  case    Q^    has  two  or  more  members, 
and  to    q    in  case    Q?    consists  just  of~q.    Because  a  set  is  com- 
pletely determined  by  its  members,  a  question  set  does  not  con- 
tain duplications.    Nevertheless  a  question  set  might  be  redundant 
in  fairly  obvious  ways;  it  might  contain,  for  example,   both     FG? 
and    GF? 

Definition  9.2:   A  direct  answer  combination   DA   of  a  ques- 
tion set    Q    is 

(1)  a  direct  answer  to    q    in  case    Q   consists  just  of    q,and 

(2)  a  conjunction  whose  n  conjuncts  include  exactly  one 
direct  answer  to  each  q  in  Q ,  and  nothing  else,  in 
case     Q    consists  of    n    questions,    where    n>  1. 

We  shall  use    'DA-'     to  refer  to    direct   answer    combinations  of 
Q-  .     In  effect,  a    DA"  of    Q    answers  every  question  in    Q. 

Theorem  9.1:   Every    Q    has  at  least  two     DA's. 
Proof;   See  Theorem  7.12. 

Theorem  9.2:   Some    Q' s  have     DA's    which   are    L-true. 
Further,    it  might  be  the  case  that  some     Q' s    have    two    or 
more  different    L-true     DA's. 
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Proof:  Q  might  consist  of  just  (Ex)Fx  ,  and  it  might  be  L- 
true  that    F    is  true  of    a    and  nothing  else,    and   that    a  =  b  . 

Theorem  9.3:  A  sufficient  condition  for  every  Q  to  have  at 
least  one  DA  which  is  not  true  is  that,  for  some  constants 
a    and    b,    (a  =#  b )    is  true. 

Proof:  A  disjunctive  question  has  at  least  two  direct  answers 
of  which  at  most  one  can  be  true.  For  the  case  of  which 
questions,    see  Theorem  7.14. 

Theorem  9.4:   Some     DA1  s    are  not  consistent,  and  it   might 
be  the  case  that  some   Q's    have  no  consistent    DA. 
Proof:   See  Theorem  7.13. 

Theorem  9.5:    DA    L- implies    CQ  . 
Proof:   See  Theorem  7.10. 

Theorem  9.6:    (DA  &  CQ)    might  not  be  consistent,   but   con- 
sistency of    DA    implies  consistency  of   (DA  &  CQ).  Further, 
if    Q    is  true,    then    CQ    is  consistent. 
Proof:   See  Theorems~9.4,    9.5,    and  7.6. 

Theorem  9.7:   If    Q    consists  only  of    F?,     .  .  .  ,    Fn?,   then 
there  is  a    DA    of    Q    such   that    (DA  &  CQ)    is    consistent. 
Proof:   Each~Fi?     is"  L-true,    so    (F~?  &7T.  &  Fn?   &  d    &.  .  .  & 
dn)    is    L-equivalent  to    (d    &.  .  .  &  dn).       Here  ~d^    is    L- 
equivalent  to  either     F^    or     -F^  .  We  want  to  prove  that  some 
such  conjunction  of    d's    is  consistent.        But   it    suffices    to 
show  that    [(F   &.  .  .  &  Fn)  v  .  .  .  v  (-F  & .  .  .  &  -Fn)]    is  con- 
sistent,   and  the  consistency  of  this  disjunction  follows  from 
the  fact  that  the  disjunction  is    L-equivalent    to    [(F  v  -F.) 
&.  .  .  &  (Fn  v  -Fn)],   which  is    L-true. 

This  theorem  is  quite  important.        It   says,    in   effect,   that  if 
one  confines  himself  to  questions  of  the  form    F?  ,    then  he   al- 
ways  has   the    right   to   expect   a   reply  which  is    consistent  and 
which  completely  answers  his  questions. 

Definition  9.3:    A  question  set    Q     answer -contains  a   ques- 
tion set    Q      relative  to  a  wff    F    if  and  only  if  every  CQ     is 
such  that    (F  &  CQ  )    L-implies  some    CQ    . 

Definition  9.4:    Two  question  sets  are  answer-equivalent  rel- 
ative   to   a  wff    F    if  and  only  if  each  answer -contains   the 
other  relative  to    F  . 

Theorem  9.8:   Answer -containment  relative  to    F    is    reflex- 
ive and  transitive.     Answer -equivalence  relative  to     F     is 
reflexive,    symmetric,    and  transitive. 
Proof:   See  proof  of  Theorem  8.4. 

In  communication  situations    R    wants    to  be   able   to   evaluate 
the  completeness  of  a  question  set.     Intuitively  it  seems  that  the 
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degree  of  completeness  of    Q    is  the  degree  to  which  the  questions 
in    Q    are    "about"   the  subject  matter  one  is  interested  in.    Com- 
pleteness,   then,    is  a  function  of  both  questions  and  subject  matter. 
We  can  regard  subject  matters  as  sets  of  wffs  and  then   take   the 
completeness  of    Q    to  be    a   function   of   the    degree    to   which   the 
DA's    of    Q    are  about  the  wffs  constituting  the  subject  matter.    Is 
there  a  satisfactory  measure  of   "aboutness"  which  could  be  used 
here?    Some  effort  has  been  made  to  achieve  one,1   but   we    shall 
not  assume  here  that  the  problem  has  been  solved  in  a    satisfac- 
tory way.     What  we  do  assume  is  that  any  measure  of  aboutness 
applicable  to  all  wffs  of    L    will  be  applicable  in  particular  to  di- 
rect answer  combinations  and  subject  matters.       Though  waiting 
for  a  satisfactory  quantitative  concept  of  aboutness  to  be  developed, 
we  do  have  available  a  qualitative  analysis    of   completeness,    as 
follows : 

Definition  9.5:    A  state-description  in  a  set  of  wffs  is  a  wff 
which   L-implies,  for  every  wff    F    in  the  set,  either    F  or  -F. 

This  definition  is  much  more  liberal  than  the  usual  definitions 
of   •  state  -de  s  cription' }    For  example,  if    F    is    L-  false,    then    F 
is  a  state-description  in  every  set  of  wffs;    and  every  set  of  wffs 
has  infinitely  many  state-descriptions.     Thus  our  notion  of  state- 
description  might  appear  trivial,   or  at  least  fruitless    for   quan- 
titative purposes.     It  will  prove  very  convenient,  however,  to  use 
the  notion  in  just  this  form.     In  our  model  of  communication   we 
want  to  make  life  as  easy  as  possible    for    an  honest    R,    and  as 
rough  as  possible  for  a  deceitful    S  ;    in  particular  we  want    R   to 
be  able  to  construct  complete  question  sets  easily.     If    R    wants 
to  check  on  the  completeness  of    Q ,  he   will  have    to    investigate 
whether  various  wffs  happen  to  be  state-descriptions,    and   Defi- 
nition 9.5  is  intended  to  spare    R    the  additional  labor  of  investi- 
gating the  consistency  of  all  these  wffs.     A  further  advantage    of 
our  definition   is    that   it   helps    provide    further    possibilities    for 
trapping  a  deceitful  or  incompetent    S  .    For  example,    R    can  con- 
struct question  sets  which  are  complete,  but  cannot  be  answered 
completely  in  a  consistent  way.     Possibly    S    would  be    aware    of 
this  and  be  able  to  avoid  the  trap,   but  he  could  not  detect  merely 
from  this  whether    R    was  being  sly  or  stupid. 

Definition  9.5  poses  no  technical  problems  for  the  logical  de- 
velopment of  our  analysis;  it  actually  simplifies  it,  for  there 
are  never  any  doubts  about  the  existence  of  state-descriptions. 
Also,  the  infinity  of  state-descriptions  need  never  be  an  embar- 
rassment; we  can  always  cut  down  the  number  by  imposing  suit- 
able restrictions:  We  can  specify  'consistent  state-descriptions' 
or  'the  alphabetically  first  state-description',    etc. 


1  Putnam  [1958]    and  Goodman  [1961]. 

2  E.g.,    Carnap  [1950],   Sec.  18. 
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Theorem  9.9:   Every  finite  set  of  wffs  has  a  consistent  state- 
description. 

Proof:  For  each  wff  F  in  the  set,  form  the  disjunction 
(F  v  -F)  .  Combine  all  such  disjunctions  into  [(F  v  -F  ) 
&.  .  .  &  (Fn  v  -Fn)],  and  convert  the  latter  into  [(Fx  &:.  .  .  & 
Fn)  v  .  .  .  v  (-F  £"...&  -Fn)]  ■  Each  disjunct  in  this  latter 
disjunction  is  a  state -description,  and  since  the  closure  of 
this  disjunction  is  L-true  at  least  one  of  its  disjuncts  must 
be  consistent. 

Definition  9.6:   Q    is  complete  for  a  set    W    of  wffs,  relative 
to  a  wff    F ,    if  and  only  if  every    DA    of    Q    is    such  that 
(F  &  DA)    is  a  state -description  in    W. 

Thus  if    Q    is  complete  for    W    and    R    receives  a    DA    of    Q, 
then,   with  the  aid  of    F  ,   R    can  know  how  things  stand  in  the 
little  world  of    W. 

Theorem  9.10:   If    W    is    a  finite    set  of  statements    F  , 
.  .  .  ,    Fn,    then  the  set  consisting  of    F  ?,     .  .  .  ,    Fn?     is 
complete" for    W    relative  to  any  wff. 
Proof:   See  proof  of  Theorem  9.7. 

As  noted  above,  some  question  sets  are  redundant.  The  set  in- 
cluding F?  and  FG?  is  redundant,  and  seems  more  redundant 
than  the  set  whose  only  member  is  FG?.  A  redundant  Q  is  a 
costly  Q  ,  but  not  all  redundancy  is  waste;  R  might  find  redun- 
dancy useful  in  overcoming  "noise"  or  trapping  a  deceitful  S  . 
Can  we  measure  the  amount  of  redundancy  in  Q?  All  we  claim 
here  is  that  any  measure  of  redundancy  developed  for  wffs  can 
be  applied  in  particular  to  direct  answer  combinations. 

The  analysis  of  communication  in  this  book  makes  only  one  de- 
mand on  our  theory  of  redundancy.  In  some  of  the  procedures  by 
which    R    evaluates  the  performance  of    S   he  must  be  able  to  al- 
low for   and  neutralize  any   amount  of  redundancy  in    Q  .    Fortu- 
nately there  is  a  way  of  doing  this  which  does  not  require  a  meas- 
ure of  redundancy.     For  example,    suppose  we  want   to   measure 
the  amount  of  information  in    Q    where    Q    is  redundant.  We  can- 
not take    I(Q)    to  be  the  sum  of  the    I    values  of  the    q's    in     Q  , 
where    I(q)    is  a  function  of  the    I    values  of  the    d's    to    q,    but 
we  might  take    I(Q)    to  be  a  function  of  the    I    values  of  the    con- 
sistent   DA's  of    Q.     In  this  way  we  bypass  the  problem  of  redun- 
dancy completely  —  we  let  the  logical  structure  of  the  DA's  ab- 
sorb whatever  redundancy    Q    has. 

In  communication  situations,  what  kind  of  a  question  set  should 
R    choose?      How  much   redundancy   should   it   contain?       When 
should  he  augment    Q?    Which  questions  in    Q    should    R  actually 
communicate  to    S,  and  which  ones    should  he   merely   "have   in 
mind"?    In  what  order  should  he  arrange  the  questions  which  he 
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actually  puts  to    S?     It  seems  clear   that  these   matters    should 
be  decided  on  the  basis  of  such  considerations  as  the  complete- 
ness of    Q,    the  reliability  of    S,    the  risk  involved  in  acting  on 
a  false    DA,    the  good-will  of    S    toward    R,   and  so  on.     A  de- 
tailed exploration  of  these  matters  belongs  to  a  later    stage   of 
our  inquiry. 


10.   ADEQUACY    OF   OUR   THEORY 


How  adequate  is  our  theory  of  questions?  Does  it,  in  the  first 
place,    satisfy  the  conditions  of  adequacy  adopted  in  Section  6? 

Condition  (1),  concerning  the  construction  of  rational  questions, 
is  satisfied;    this  is   shown  by  Theorems  7.4,    7.5,    and  7.6.     Con- 
dition (2),    concerning  the  recognition  of  questions,    is    satisfied; 
questions  are   defined  solely  by  reference  to  grammatical  forms 
which  are  effectively  recognizable.   Condition  (3),  concerning  the 
existence  of  answers,    is  satisfied   because  of  our   definitions    of 
'question'    and   'direct  answer' .     Condition  (4) ,    concerning  suffi- 
cient answers,    is    satisfied  because  of  our  definitions  of  'direct 
answer'    and   'partial  answer',    and   Theorem  7.5.     Condition  (5), 
concerning    containment  and   equivalence,    is  satisfied;    see  Defi- 
nitions 8.2  and  8.3,   and  Theorem   8.4.       Condition  (6),    requiring 
that  answers    can  be   L-implied  by  statements,   is    satisfied  be- 
cause of  our  characterization  of  L-implication  and  our  definitions 
of  'direct  answer'    and   'partial  answer' . 

Is  our  theory  of  questions  adequate  for  our  model  of  communi- 
cation? In  satisfying  conditions  (1)  to  (6),  we  have  gone  a  long 
way  toward  an  adequate  theory,  and  we  shall  consider  that  our 
theory  as  developed  thus  far  is  an  adequate  basis  for  the  develop- 
ments of  Sections  12  to  16.  Nevertheless,  we  must  acknowledge 
that  our  theory  of  questions  needs  further  development  before  we 
can  claim  for  it  any  sort  of  completeness. 

Among  the  many   types  of  question  which   we  have  not  yet  ana- 
lyzed are    "how"   and   "why"   questions.       It   seems   plausible   to 
analyze  these  as  questions  about  expressions  of    L,    asked   in  a 
metalanguage  of    L.    Thus    'Why  is  it  the  case  that.  .  .  ?'    can  be 

construed  as    'Which  sentences  of   L   are  good  reasons  for ?', 

and  'How  does  .  .  .  ?'    can  be  construed  as    'Which  sentences   of 

L    are  good  descriptions  of ?'.     Other  examples  of  questions 

which  must  be  asked  in  ML   are  'Is    F    true?'    and   'Does   q  have 
any  consistent  answers?'.       We   might   even  be  forced  to   ascend 
from  the  metalinguistic   level  to   the   metametalinguistic   level. 
For  example,    'What  is  the  meaning  of  .  .  .  ?'    might  have   to  be- 
construed  —  in  MML  —  as    'Which  sentences  of  ML   are   rules 
of  interpretation  of.   .   .  ?'. 

In  addition  to  construing  such  questions  as  metalinguistic,  we 
might  have  to  construe  many  types  of  reply  as  assertions  of  a 
higher  level.     Consider  the  following  conversation: 

50 
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(1)  'Is  Throop  a  Democrat?1 

(2)  'Throop  isn't  the  sort  who  really  commits  himself.' 

(3)  'But  does  he  usually  vote  Democratic?' 

(4)  'It  wouldn't  be  for  the  party;    it  would  be  for  the  man.' 

(5)  'Then  has  he  voted  for  men  who  were  Democrats?' 

(6)  'Usually  he's  not  voting  for  a  man  but  against  someone  else.' 

A  comprehensive    theory   of   communication  would   construe    (2), 
(4),    and  (6)  here  as    "illuminations"1    of  (1),    (3),    and  (5),  or  per- 
haps as  comments   made,    at  a  metalinguistic   level,   about  the 
meaning  or  value  of  the  direct  answers  to  (1),    (3),   and  (5).        To 
develop  our  theory  so  that  it  can  cover  these  matters,  we  should 
formalize   ML  ,    develop  for  it  a  theory  of  questions    like    the    one 
we  have  developed  for    L  ,   and  develop  much  additional  apparatus 
as  well,    so  that    R    and    S     can  communicate    in  ML    concerning 
their  questions  and  answers  in    L.     All  of  this    further    develop- 
ment belongs  to  a  later  stage  of  our  inquiry. 

Another  type  of  question  which  we  have  not  yet  analyzed   is  the 
"trap"  question.  Suppose,  for  example,  that   R   asks  [F*  v  [(Ex)Gx]*], 
that    (Ex)Gx    has  the  form  of  a  question  but   R   knows  that   (Ex)Gx 
is  false,    and  that    S    gives     (Ex)Gx    as  his  reply.     In  this    situa- 
tion   R    might  want  to  throw    (Ex)Gx    back  at    S    as  a  question  in 
order  to  test  or  challenge  or  trap    S  . 

In  connection  with  trap  questions  there  are  at  least  two   types 
of  problem  which  we  have  not  yet  explored.  One   problem  is 

whether  we  want  to  include  as  trap  questions  some  wffs  which  do 
not  have  the  grammatical  form  of  questions   —  a  trivial  example, 
which  might  find  possible  use  as  a  device  for  testing  the  respond- 
ent's alertness  or  knowledge,    is  the  wff   (Ex)F,    where    F    has  no 
free  variables  and  is  true.     A  more  important  problem  is  that  of 
justifying  the  use  of  questions  which  are  false.        If     R    wants    to 
ask  a  false  question  in  a  particular  context,    he  should  be  able  to 
justify  asking  it  in  that  context;    if   he    cannot  justify   the    asking, 
then    S    is  entitled  to  penalize  him.       We  might  say   that   a   false 
question    F    is  justified  for    R    in  a  communication   event   E^    in 
a  sequence     E    ,   .  .   .  ,  En     of  communication  events  if  and  only  if 
F    is     L-implied  by  the  "conjunction   of  messages    sent  by    S     in 
communication  events  preceding    E-     in  the  sequence.     Further, 
a  false  question  which  is  justified  atone  point  in  the  communica- 
tion sequence  can  subsequently  lose  its  justification  if  new  mes- 
sages imply  its  negation.       An   additional   complication  is    that   a 
false  question   F   might  be  inconsistent  with   R' s  knowledge   k  ,  so 
that    R    cannot  conjoin   F    with    k    as  an  aid  in  making  deductions. 
Thus,  in  general,   R    can  use* false  questions  only  in  isolation,   as 
goads  to  provoke    S    into  making  worse  blunders.    In  any  case  we 
shall  do  nothing  more  with  trap  questions  in  this  book. 


1  In  perhaps  a  more  general  sense  than  that  of  D1Z  of  Harrah   [1961], 
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What  is  the  likelihood  that  this  theory  can  be  developed  to  be- 
come adequate  not  only  for  our  model  of  communication  but  also 
for  the  entire  field  of  questions?  To  answer  this,  one  should  first 
clarify  'entire  field  of  questions'  and  stipulate  criteria  for  ade- 
quacy. We  shall  attempt  neither  the  clarification  nor  the  stipu- 
lation here,   but  will  make  one  claim  and  several  conjectures. 

We  conjecture  first  that  the  best  method  for  developing   a   de- 
scriptive model  of  natural  question  behavior  is  the  Ajdukiewicz- 
Hiz  method.2    This  method  involves  the  use  of  a   "questioner"   of 
the  form   'For  what    x'.     A  typical  question  is    'For  what    x,   x 
read  the  book  to  Peter';    its  answers  are  the  result  of  replacing 
'  x'    in   'x    read  the  book  to  Peter'    by  phrases  of  the  proper  syn- 
tactical category.     Assuming  that  the  notion  of  proper  syntactical 
category  can  be  made  precise,    this   method   seems   to   yield  a 
simple  analysis  of  a  great  variety  of  questions  as  found  in  ordi- 
nary language. 

Our  second  conjecture    is   that  the   best  method  for    analyzing 
problem-solving  and  information-seeking  programs  is  the  Medvedev- 
Stahl-Hamblin  method.3    This  method  is,    essentially,   to   identify 
questions  with  their  answers    or   with  the   means   for    answering 
them.     The  great  advantage  of  this  method  is  that  it  invites  direct 
application  of  advanced  methods,   such  as   those   of  the   theory  of 
recursive   functions  or  the  theory  of  selective  information. 

Our  claim  concerns  the  relative  strength  of  our  theory  of  ques- 
tions for  analyzing  sender-receiver  interactions.  We  do  not  claim 
that  the  particular  results  presented  in  this  book  are   definitive,5 
but  we  do  claim  that,   for  analyzing  sender -receiver  interactions, 
the  general  approach  argued  for  in  this  book  has  certain  definite 

2  See  Hiz   [1961]. 

3  Medvedev   [1955];    Stahl  [1956],  p.  74  fn;    and  Hamblin  [1958].        See 
also  Hick  [1951],  p.  160,    and  Naess  [1953],  p.  120. 

4  Belnap  has  pointed  out  that  even  on  the  theory  adopted  in  this   book 
we    can   get   the    effect   of  the    set-of-answers    theory,    in   the   following 
sense:    Taking    'question'    in  the  sense  of  this  book,    and   assuming   that 
L.    is  sufficiently  rich,    then:    Given  a  question    q  ,    the  set  of  answers  to 
q    is  effectively  constructible  from    q;    and  given  a  set  of  answers,    a 
question  can  be  constructed  to  match  it. 

5  Belnap  is  currently  developing  an  analysis  of  questions  along  some- 
what the  same  lines  but  with  significant  differences  in  detail.     Belnap1  s 
theory  advances  on  the  one  presented  here  in  several  ways;  e.g.  ,  it  se- 
cures a  straightforward  analysis  of   'What  is  an  example  of .  .  .  ?'    ques- 
tions.    See  Belnap  [1962]. 

It  remains  to  be  seen  what  the  relative  advantages  are  between  the 
analysis  of  this  book  and  the  analysis  of  Kubinski  [1958].  These  two 
analyses  differ  in  some  details  of  internal  development,  and  in  many 
details  of  interpretation.  Kubinski  differs  from  us,  e.g.  ,  in  treating  'Is 
it  the  case  that  p  ,  and  is  it  the  case  that  q?1  as  a  single  question,  to 
which    -p    counts  as  an  indirect  but  complete  answer. 
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advantages  over  other  approaches.     One  advantage  of  our  theory 
is  that   answers  —  both  direct  and  partial  —  can  be    implied  by 
messages.       More  generally  and  more  important,    both  questions 
and  answers  are  statements  and  thus  can  have  information  values, 
so  that  the  analytical  power  of  the  theory  of  semantic  information 
can  be  applied   to  question-and-answer   situations.      In  support  of 
our    claim   we  offer    Sections   14   to    16.     Concerning  the  adequacy 
of   our    theory   as    a   descriptive   model,    we    offer    the   following 
quasi-conjecture :    Perhaps    it  might   be   possible  to    formulate  a 
reasonable  set  of  conditions  of  adequacy  for  descriptive  theories 
of  natural   question  behavior,    such   that   we    could    construct   an 
adequate  descriptive  theory  by  combining  the  method  of  this  book 
with  the  Medvedev-Stahl-Hamblin  method.      Using  this  combina- 
tion, we  might  define  questions  as  ordered  pairs,    the  first  mem- 
ber of  each  being   a   question  in  the    sense   of  this   book  and  the 
second  being  the  set  of  direct  answers  in  the   sense  of  this    book. 
Working  out  the  formal  details,  and  examining  the  empirical  data, 
must  be  left  to  the  future. 


11.   INFORMATION 


Now  that  we   are   in  a  position  to    specify    L    and    Q    for   our 
model,    the  next  step  is  to  choose  an  information  function    I.    We 
want    I(p)    to  represent  the  amount  of  information  conveyed  by    p  . 
In  particular,    if    p      L -implies    p    ,    then    p      says    at  least   as 
much  as    p      and  we  want    I(p  )    to  be  at  least  as  great  as    I(p  )  . 

Will  any  of  the  information  theories  available  today  satisfy  R's 
needs?  So  far,  two  major  types  of  information  theory  have  been 
developed.1  The  first  is  the  theory  of  selective  information,  which 
came  into  prominence  with  the  work  of  Shannon.  This  theory  was 
developed  to  answer  questions  about  the  capacity  of  a  communi- 
cation channel  to  transmit  information;  it  construes  information 
as  a  statistical   concept,    and  its    central   concept  is   average  in- 


formation^.      Roughly,    the   average   information   in  the  messages 
being  considered   is  the  average  surprise -value  of  the  messages, 
the  average  surprise   of  the  receiver  at   receiving  the  messages. 
The  degree  of  average  surprise  is  computed  from  empirical  prob- 
abilities, the  relative  frequencies  of  occurrence  of  the  messages. 

Unfortunately,  although  R  can  use  selective  information  func- 
tions for  some  purposes,  he  cannot  use  them  as  the  I  under  con- 
sideration here.  The  theory  of  selective  information  is  concerned 
with  large  sets  of  messages  and  is  not  intended  to  assign  infor- 
mation values  to  individual  messages;2  in  particular,  it  does  not 
guarantee  that    I(p  )  ^  I(p  )    in  case    p      L-implies    p    . 

The  second  currently  available  theory  is  the  theory  of  semantic 
information,    created  by  Kemeny,    Carnap,    and   Bar-Hillel.    This 
theory  was    developed   to   provide   a   measure   of  the    "logical 
strength"   of  individual  statements,    so   that  one   might  measure 
the    "degree  of  daring"  of  a  hypothesis  or  the  "weight"   of  a  piece 
of  evidence.      The  amount  of  semantic  information  is  determined 


1  Other  information  theories  have  been  developed,  but  for  our  purposes 
these  can  all  be  regarded  as  varieties  of  the  two  major  types.  Fisher's 
theory  of  statistical  information  has  affinities  with  the  theory  of  selec- 
tive information;  see  Goldstine  [1961]  ,  p.  1395.  The  theories  of  MacKay 
[1950]  and  Brillouin  [1956]  can  be  regarded  as  variations  or  extensions 
of  the  theory  of  selective  information;  see  Von  Foerster  [1952],  pp.  181- 
221.  For  discussions  of  information  theory  which  are  more  directly  rel- 
evant to  this  book,  see  Cherry  [1957],  Carnap  [1952a],  Kemeny  [1953], 
and   Bar-Hillel   [1955]. 

2  This  is  one  of  the  greatest  temptations  in  the  application   of  this 
theory;    for  cautions  against  it,  see  Von  Foerster    [1952],   p.  219. 
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logically  rather  than  empirically.     Roughly,    if  we    think   of  the 
simplest  statements  of  the  language  as  its    "atomic"  statements, 
then  the  amount  of  information  in  a  statement  is    determined  by 
the  number  of  atomic  statements  implied  by  the  given  statement. 

Carnap's  theory  applies  only  to  languages  in  -which  the  predi- 
cates are  independent,  one-place  predicates.3  Kemeny's  theory 
applies  only  to  languages  satisfying  the  condition  that  no  state- 
ment of  infinity  is  provable  in  the  language.  Kemeny  has  argued 
that  this  condition  imposes  no  significant  limitation  on  his  theory, 
though  at  least  one  critic  has  asserted  the  contrary.  For   the 

sake  of  generality  —  generality  in  our  treatments  of  both   L   and 
I   —  we  shall  not  commit  our  model  to  a  specification  of     I     in 
terms  of  either  Carnap's  or  Kemeny's  theory.    We  shall  not   de- 
fine   I.     Rather,   we  shall  give  a  partial  characterization  of     I 
by  way  of  a  set  of  axioms. 

Our  specific  assumptions  about    1^   are  the  following: 

(1)  At  any  given  time    t  ,    R    has  an  information  function    I  . 
At  time    t ,    I    assigns  to  any  given   statement   of     L_    a 
real  number,    which    R    believes  at    t    is  the  information 
value  of  the  statement. 

(2)  0  <  I(p)  <  oo  . 

(3)  I(p)  =  0    if  and  only  if    p    is    L-true. 

(4)  I(p)  =  oo   if  and  only  if    p    is  an   L-false  statement. 

(5)  If    p     L-implies    p   ,  then  it  is  not  the  case  that  I(p  )  <  I(p  ). 

(6)  The   information  of    p      relative  to    p      is  defined  by 

—l  —2 

3Ei  I  E2>  =  1^  &  p2>  -  I(p2)- 

Because  our  axiomatic  characterization  of  I  might  have  some 
interest  in  its  own  right,  it  is  worth  noting  that  assumptions  (2) 
to  (5)  are  theorems  in  Carnap's  theory  and  (6)  is  adopted  as  a 
definition  in  Carnap's  theory.  In  Kemeny's  theory  (2)  and  (5)  are 
theorems,  but  it  is  not  always  the  case  that  if  I(p)  =  0  ,  then  p  is 
L-true,    or  that  if    I(p)  =  oo  ,    then    p    is    L-false.5 

Before  discussing  these  assumptions  further,  we  shall  present 
a  few  theorems,  most  of  which  will  be  important  in  later  sections. 

Theorem  1 1 . 1 :   If    p     is    L-equivalent  to   p    ,  then  I(p  )  =  I(p2). 
Proof:    p      and    p     L-imply  each  other .     So,   by  assumption 
(5),     Ifp^I^fand    Itp^I^). 

Theorem  11.2:   If    p     L-implies    p   ,  then    I(p    &  p   )  =  I(p  ). 
Proof:    (p    &c  p  )    is    L-equivalent  to    p    ,     so   the  theorem 
follows  from  Theorem  11.1. 


3  Concerning  the  seriousness  of  this  restriction,    see    Carnap   [1951] 
and  Kemeny  [1952]. 

4  Kemeny   [1952],  p.    41,    and  Popper    [1959],   p.  376  fn. 

5  Carnap   [1952a],   Sec.  7,    and  Kemeny   [1953],   p.  298. 
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Theorem  11.3:   I(p j  &  p  J  >  I(p  ^  >  I(p  1  v  p  2) . 

Proof:    This  comes  via  assumption  (5) 

Theorem  11.4:   I(p    |  p   )  >  0  . 

Proof:    (p    &  p   )    L-implies    p    ,    so    I(p    &  p   )  $■•  I(p  ). 

Theorem  11.5:   If    p      L-implies    p    ,   then    I(p    |  p   )  =  I(£  )  -  I (p 
Proof:   Here,     (p    &;  p   )    is    L-equivalent  to    p    ,    so    I(p    |  p  ) 

ns7)-K£2)- _1  "2    "  -2     --i  -2 

Theorem  1 1.6:   If    p     L-implies    p    ,  then    I(p    |  p  )  >I(p    |  p  ). 
Proof:   (p    &  p3)    L-implies     (p    &  p  ),    so    I(p    &  p  )  >  I (p    &  p  ). 
Thus    IJp^fe  p3)  -  I(p3)  >I(p2  &  p_3)  -  I(P3). 

Theorem  11.7:    I[(Ei  &  p_2)  |   (£2  &  p_3)]  =  l[p    |   (p_2  &  p^]. 
Proof;    (p     &  p2&  p    &  £  )    is  L-equivalent  to    (p    &  p    &  p  ). 

so  ittPjl?  p2)"j  (g2  &2  p3)]3=  ][(£"&  p2  &  p_3)  -  i(g2  ^pJT2 

Because  the  above  six  assumptions  characterize    I    only  par- 
tially,   our  model  is  still  relatively  general  with  respect  to     I. 
We  allow    R    to  define    I    in  various  ways,   perhaps  as  a  function 
of  probability;    alternatively,    I    might  be   a   subjective   function 
whose  values  are  determined  either  wholly  or  in  part  by  subjec- 
tive judgments.      Because  of  Theorem  11.1,    I    cannot  be   a   sub- 
jective information  function  in  the  sense  of  Wells.7  Also,  because 
of  assumption  (5),    I    cannot  be  interpreted   as   a  utility  function 
in  the  sense  required  for  decision  theory.     This  can  be    seen  as 
follows:    Let    p      be    'You  will  receive  a   dollar,  and  you  will  be 
shot1    and    p      be    'You  will  receive  a  dollar'.      According  to   as- 
sumption (5),    I(p    $.  I(p    ),    and  so   [we  assume]    I    does  not  rep- 
resent the  relative  desirability  to    R    of  securing  that    p     or    p 
is  the  case. 

There  is,   however,   a   different  way  of  interpreting    I      as    a 
utility  function.        Suppose    p    signifies   a    state   of  affairs    over 
which    R    has   no    control,   and  in  particular   a    state   of  affairs 
which  has  already  occurred  but  which    R    does  not  know  has   oc- 
curred.    If  we  assume  that    R    prefers  knowing  what  is   the    case 
to  not  knowing  what  is  the  case,   then    I(p)    can  represent  the  rel- 
ative desirability  to    R    of  knowing  that    p    is    indeed   the    case. 
Informally,   at  least,   we  shall  think  of    I(p)     as    representing  the 
result  of  two  evaluations: 

(1)  an  evaluation  of  the  amount  of  f actuality  in    p  ,    or    the 
distance  between    p    and  triviality,    and 

(2)  an  evaluation  of  the  utility  of  being  told  that    p    is  the 
case,   where    p    is  indeed  the  case. 


6  Concerning  subjective  functions,  see  De  Finetti  [1958], 

7  Wells   [1961]. 
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In  regard  to  the  connection  between    I    and  his  probability  func- 
tion,   R    does  not  have  complete  freedom.     Because  of  assump- 
tion (4),   if    I(p)  =  -log  P(p)  ,    then  only  L-false  statements   have 
zero  probability,    and  some  theories  of  probability  deny  this.8  To 
preserve  what  generality  we  can,  we  shall  at  least  for  the  present 
make  no  explicit  assumptions  about  the  connection  between     I 
and  probability.     Informally,   however,   for  heuristic   purposes, 
we  shall  occasionally  speak  as  though    R    had  a  probability  func- 
tion   P,   and  as  though    P    and    I_  were  connected  at  least  to   the 
extent" that  if    P(p  )  >  P(p  )   then  J[(P  )   >  UP  )■ 

It  might  be  urged  that  assumptions  (3)   and   (4)    are   unreason- 
able,   in  that 

(3")    I(p)  =  0    only  if    p    is    L-true,    and 
(4")    I(p)  =  oo    only  if    p    is    L-false 

are  doubtful  or  undesirable.     To  defend  our  adoption   of   (3)    and 
(4)  it  will  suffice  to  discuss  (4').        The   objection  to    (4')    stems 
from  the  feeling  that    I     should  be  connected  with    P   in  a  certain 
definite  way  —  either  via  the  usual  definition  or  via  the  interpre- 
tation and  application  of    I    and    P  —  and  the  feeling  that  the  prob- 
ability analogue  of  (4''),   namely 

P(z)   =    0    only  if    z    is  impossible, 

is  unreasonable.  This  feeling  is  supported  by  cases  like  that  of 
the  pointer  on  the  dial.  We  spin  the  pointer,  and  then  it  comes  to 
rest,  pointing  at  some  position  on  the  dial.  How  many  positions 
could  it  point  to?  Nondenumerably  many.  So  the  probability  that 
it  will  come  to  rest  at  any  given  position  is  vanishingly  small  — 
though  clearly,  its  coming  to  rest  at  the  given  position  is  logi- 
cally possible. 

If  we  were  swayed  by  arguments  like  this,  we  should  allow  that 
the  information  value  of  a  sentence  could  be  infinite  even  where 
the  sentence  is  not   L-false.     But  surely  this  is  absurd  for   any 
sense  of  'information'    which  is  to  be  used  for  paying   for   mes- 
sages.    The  problem  is  not  that  we  would  have  to  pay  the    same 
amount  of  cash  for  some  consistent  messages  as  for  any  incon- 
sistent one,    for  we  could  adopt   a   rule   of  message-processing 
according  to  which  any  inconsistent  message  is  rejected  as  in- 
eligible   for  payment.     The  problem  is  rather  how  to  determine 
a   fair  payment  for  a  consistent  message    conveying   infinite   in- 
formation.    Clearly,   we  should  not  have  to  pay  infinite  cash  for 
any  message. 

One  cannot  both   (a)   hold  that  a  consistent  message  conveys 
infinite  information  and   (b)    refuse  to  pay  anything  for    such  a 


E.g.,   Chernoff  [1959],   p.  51. 
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message,  for  such  a  refusal  would  be  tantamount  to  denying  that 
the  message  conveyed  any  information  at  all.       One  might,   while 
allowing  that  consistent  messages  can  convey  infinite  information, 
try  to  formulate  a  payment  procedure  by  which  the  pay  for  such 
a  message  can  be  cut   down  to  a  reasonable   amount,  but  what 
principle  could  he  use   for   the  cutting   down?      If  he   decided  to 
evaluate  the  message   by   some  function  other   than    I  ,   he   would 
in  effect  be  admitting  that    I    is  inadequate  as  an  evaluation  func- 
tion, that  infinite  information  is  not  measurable  information,  that 
there   is  really  no  point  in  assigning  any  magnitude   whatsoever 
to  such  information. 

In  practice,    if   someone  announced   that  the  pointer   had   come 
to  rest  precisely  at  position    n  ,    n    being  one  out  of  nondenumer- 
ably  many  possible  positions,   he   would  be  regarded  as  ignorant 
or  vicious.       A  rational  man  would  announce  that  the  pointer  had 
come  to  rest  roughly  at  n,  or  in  the  interval  n±m;    or  he  would 
announce  that  his  observations  and  calculations  lead  him  to  be- 
lieve that  the  pointer  is  at  rest  at    n  .     These  forms  of  announce- 
ment are  reasonable,   but  they  alter  the  probability  and  informa- 
tion values  of  the  situation.     On  the  dial  face  there  is  only  a  finite 
number  of  rough  positions  or  intervals,    so  the  probability  of  any 
given  position  is  no  longer  vanishingly  small,  and  the  information 
of  any  given  pointer  report  need  not  be  considered  infinite. 

Because  of  all  these  considerations,  we  do  not  foresee  any  sit- 
uation in  which  a  rational  R  need  regard  a  consistent  message 
as  conveying  infinite  information.  We  might  have  chosen  not  to 
commit  ourselves  on  this  point,  not  to  assume  (4').  Our  reason 
for  assuming  (4)  and  thereby  (4")  is  that  this  assumption  is  very 
useful  in  the  analysis  below. 

We  shall  assume  informally  that    I    is  a   "face  value"   function, 
a  semantic  function  which  assigns  an  information  value  to  a  state- 
ment according  to  the  content  of  the  statement.     As  such  I    con- 
trasts with  the  pragmatic  information  function  which  assigns    an 
information  value  to  a  message  according  to  both  the  probability 
that  the  message  is  true,  computed  before  the  message  is  received, 
and  the  probability  that  the  message  is  true,    computed   after    the 
message  is  received.9    We  might  say  that  such  pragmatic    infor- 
mation functions  are  double -probability  functions,    and   that   our 
semantic  function    I     is,    implicitly  at  least,    a  single -probability 
function.     As  a  basis  for  computing  cash  payments  for    messages, 
double -probability  functions    are   better    than   single -probability 
ones.     If    R    pays  cash  according  to  single -probability  functions, 
he  can  delay  payment  for  a  time,    during   which  he    can   examine 
the  world  and  cross-examine    S  ,   but  when  payday  finally   comes 


9  Goldman   [1953],    Chaps.  I  and   IX. 
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he  cannot  scale  down  the  payment  by  subtle  gradations;    he    must 
either  accept  or  reject  the  message  in  toto  and  pay  either  the  full 
amount  or  nothing.     In  the  analysis  below  we  shall  think  of  R   as 
using    I    to  make  an  analysis  of  the  ostensible  semantic  proper- 
ties of  the  message  sequence,    and  using  other  functions  for  mak- 
ing cash  payments.     We  shall  return  to  this  point  later. 

There  is  another  type  of  information  function  which  might  seem 
to  overlap    I_  in  some  respects.     This  function,   which  we   denote 
by   'V,    is  intended  to  measure    "value  of  information".  Its  cru- 
cial properties  are  the  following  three: 

(1)  0<  V(p)  ^  1. 

(2)  V(p)  =  0     if  and  only  if    p    is  either    L-true  or    L-false. 

(3)  V(p)  =   1     if  and  only  if    P(p)  =  P(-p)~  f  . 

Such  a    V    can  be  defined  in  various  ways,10  but  we    shall   regard 
V    as  sufficiently  specified  by   (1)  to  (3).        We    can  use    V     as    a 
fractional   multiplier   which  represents   the    relative   helpfulness 
of  having  information.        For    example,    consider    a   businessman 
who  can  gain  a  thousand   dollars   if  a   certain   event    E    occurs. 
What  is  the  value  to  him  of  knowing  whether  or  not   E  has  occurred; 
that  is,   how  much  should  he  be  willing  to  pay  to  find  out  for  cer- 
tain whether  or  not    E    has  occurred?       Clearly  this   depends   on 
how  great  his  doubt  is  about    E  .       If  he    is    completely  uncertain 
about    E  —  that  is,   if  the  probability  that    E    has    occurred  is   \ 
—  then  the  value  of  knowing   for    sure   about     E     is    greater   than 
the  value  of  knowing  in  the  case  where  he  is  less  uncertain  about 
E  .     If  the  probability  is  \  ,    then    V  =  1,  so  the  businessman  ought 
to  be  willing  to  pay  up  to  a  thousand  dollars  for  knowledge  that   E 
has  in  fact  occurred. 

Since  we  can  interpret    V    as  an  information-value   function, 
the  question  arises,    'Could    R    use     V    in  place  of    I?1.         There 
are,    of  course,    important  logical  differences;    e.g.  ,    if    p    is    L- 
false,    then  _I(p)  =£  V(p)  .     A  graph  showing    V    values  does  not  dif- 
ferentiate between    L-truths  and    L-falsehoods,   as  does  a  graph 
showing    I_  values.     Questions  about  the  overlapping  of    I    and    V 
arise  when  we  interpret    I     as  measuring  amount  of   information, 
interpret    V    as  measuring  reduction  of  uncertainty,  and  construe 
amount  of  information  as  amount  of  reduction  of  uncertainty.  Now 
we  can  take    V    as  measuring  relative  distance  from  uncertainty, 
or  amount  of  reduction  of  doubt,   but  we  need  not  interpret     I(p) 
as  the  amount  by  which    p    reduces  uncertainty.   We  may  instead 
interpret    I  ,    as  indicated  above,    in  terms  of  the  amount  of  fac- 
tuality  in    p.     In  our  model  we  can  allow    R    to  use  both    V     and 
I,    V    for  calculating  the  cash  value  of  making  inquiries,    and     I 
for  assessing  the  semantic  importance  of  messages.11 

10  See  Marschak  [1954],  especially  p.  205. 

11  Cf.  the  interpretation  given  by  Marschak   [1954],   pp.  204  f. 
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Finally  let  us  note  some  connections  between    I    and  our  theory 
of  questions : 

Theorem  11.8:    UF?)  =  _I[(-F)?]  =_I(G?)  =  0  . 

Theorem  11.9:   It  might  not  be  the  case  that  J(?Fx   .  .  .  x    )  =  0 
or  that  _I(F  .  .  .  Fn?)  =  0  . 
Proof:   See    Theorem  5.1. 

Theorem  11.10:   If    q    is  a  question  and    d    is  a  direct  answer 
to    q,    then  _I(d)  ^l(q)  . 
Proof:   See  Theorem  7.10. 

Theorem  11.11:   If    p    is  a  partial  answer  to    q      then  there 
is  some  direct  answer    d    to    q    such  that    1(d)  ^.  I(p)  . 
Proof:   See  Definition  8.1. 


12.    COMMUNICATION   EVENTS 


In  preceding  sections  we  specified  the  essential  components  of 
R's  mind  —  L,  k,  Q,  and  I.  The  next  step  is  to  put  them  to- 
gether. Out  of  them  we  shall  construct  various  communication 
situations  which  are  important  to    R  . 

Definition  12.1:   A   communication   event   for    R    at   time    t 
is  an  ordered  sextuple     <m,    L,    k,    Q,    I,    i>     such  that 

(1)  L    satisfies  the  conditions  LA  1  to  18  specified  in  Sec- 
tions 4  and  5; 

(2)  m    is  a  statement  of    L; 

(3)  k    is  a  consistent  statement  of    L  ; 

(4)  I    is  an  information  function  satisfying  the  six  assump- 
tions specified  in  Section  11; 

(5)  Q    is  a  proper  question  set   [in  the  sense    of   Definition 
9.1],    and 

(6)  i    is  a  positive  integer. 

Definition  12.2:    The  message  in  a   communication  event    E 
for    R    at    t    is  the  first  member    of    E.       A  message  for    R 
at    t    is  a  message  in  a    communication  event  for    R    at    t  . 

We  shall  use    !m-',    '  L-  ' ,    etc.,  to  refer  to   the   first,    second, 
etc.  ,   members  of  the  communication  event    E-  ;  we  shall  use    '  q. ' 
to  refer    to  members    of    Q^  .      Occasionally,  "where   there  is  no" 
danger  of  ambiguity,  we  shall  use    'communication   event  for    R1, 
leaving  the  time  parameter  understood,  or  'communication  event', 
leaving    R    understood.    Similarly,  we  shall  use  'message  for   R' 
or    'message'.     The  function  of    i   [that  is,   the  sixth  member    of 
a  communication  event]    will  be  explained  later. 

Informally,  we  assume  that  R  uses  L  and  I  at  t ,  that  he 
believes  at  t  that  k  is  true,  that  he  asks  or  has  in  mind  the 
questions  in  Q  at  t,  and  that  he  examines  m  at  t  .  Precisely 
what  using,  believing,  etc.,  are  is  a  problem  we  bypass  here. 
It  must  be  kept  in  mind  that  R  is  an  unanalyzed  parameter,  so 
the  pair  <R,  m>  does  not  determine  a  communication  event. 
By  leaving  R  unanalyzed,  and  by  allowing  i^t,  we  allow  R 
to  participate  in  two  or  more  communication  events     E-     and    E^ 

at   the    same   time    t;      in   this    respect   our    model   fits     the  case 
where    R    is  a  complex  institution. 

Concerning    (2)    of   Definition  12.1:     If    R    receives  from    S     a 
wff    Fx    containing    x     as    a   free   variable,    he   must    construe 
this   wff  as  an  abbreviation   for    some    statement,    such  as    (x)Fx 
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or  (Ex)Fx  or  Fa  ,  and  take  this  statement  to  be  the  message. 
In  empirical  interpretations  of  our  model  we  have  considerable 
freedom  in  regard  to  messages.  If  we  want  to  apply  the  model 
to  the  situation  where  R  is  reading  a  book  written  by  S  ,  we  can 
construe  the  book  as  a  single  wff  —  a  long  conjunction  of  all  its 
sentences  —  or  else  as  a  long  sequence  of  messages,  each  sen- 
tence counting  as  a  separate  message.  It  will  be  evident  below, 
in  connection  with  our  theory  of  message-processing,  that  dif- 
ferent initial  specifications  of  the  message  can  make  great  dif- 
ferences in    R's    final  evaluation  of    S. 

It  might  be  urged  against  Definition  12.1  that  consistent  is  not 
an  effective  notion,  and  hence  communication  event  is  not  an  ef- 
fective notion,  whereas  we  want  communication  event  to  be  effec- 
tive. In  reply  we  would  argue  here  that  the  ordinary  notion  of 
communication  event  is  a  normative  and  noneffective  notion.  Con- 
sider a  usage  like  'We  were  conversing,  all  right,  but  we  were 
not  communicating'.  In  this  sense,  two  persons  can  communicate 
only  if  they  share  a  common  standard  of  meaningfulness,  truth, 
falsity,  and  perhaps  importance  as  well;  in  terms  of  our  model, 
R  and  S  can  communicate  only  if  L  and  k  are  consistent.  It 
is  only  in  this  sense  that  the  metalevel  question  'Can  we  com- 
municate?' is  interesting.  We  might  try  to  generalize  on  Defini- 
tion 12.1  and  define  an  effective  notion  of  message  event,  but  for 
the  developments  below  we  would  still  have  to  confine  our  atten- 
tion to  those  message  events  which  were  communication   events. 

The  communication  event  is  the  modular  unit  of  our  system; 
out  of  communication  events  we  shall  construct  all  of  our  other 
communication  situations.  The  simplest  such  construction  is 
given  by  the  next  definition. 

Definition  12.3:   An  interview  for     R    is  a  sequence    E  ,  .  .  .  , 
En  ,    where    n  >  0  ,    of  communication  events  for   R  ,    such  that 

(1)  if    l^i^j^n.    then 

(a)  the  sixth  member  of    Ej    is  identical  with    i  ,    and 

(b)  if    E  j    is  a  communication  event  for    R    at    tg   and 

E;   is  a  communication  event  for    R    at    tr  and    t 
— -  —  "I  ~g 

is  later  than    tr,   then    j  >  i;    and  — 

(2)  if    1   <  i  ^  n  ,    then 

(a)  L,.  =  L^  ,  1^  =  1^,   kt   L- implies    k.^,   and    Q.^ 
is"  included  in    Q.  ; 

(b)  if    ki  ±  k^,  then"  m*  =  m.       and    Qi  =  Q^  ;    and 

(c)  if    Q.  =£  Q.     ,    then    m-  =  m.       and    k^  =  k~     . 

Clause    (1-a)    in  this  definition,    and  the   analogous    clause   in 
Definition  12.5,    secure  that  in  the    sequences    of  communication 
events  which  are  relevant  to  us  here   no   two  events  are  identical. 
Because  communication  events  are  unique  in  this  way,    we    avoid 
certain  ambiguities  which  might  otherwise  arise  in    some    of  the 
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definitions  below.     Clause    (1-b)  arranges  the  events  in  an  inter- 
view in  an  order  which,  corresponds  to   the    time    order    in   which 
R    examines  the  messages  to  determine  their  values.    We  make 
no  assumption  concerning   whether    the    order    of   examining    the 
messages  corresponds  to  the  order  of  receiving  them,    or    con- 
cerning the  order  of  examination  of  messages  which  are  received 
simultaneously.     By  not  making  any  such  assumptions,    we  allow 
R    a  greater  flexibility;     different   arrangements    constitute   dif- 
ferent "senders"  [see  below]    and  might   determine   different   re- 
wards.    Not  only  is    R    free  to  arrange   an   "actually  given"    set 
of  communication  events  in  whatever  order   he    chooses,   but  he 
can  also  introduce  artificial  communication  events  for  clarity  or 
technical  convenience. 

Clause    (2 -a)   allows    R    to  enlarge  his  knowledge  or  his  ques- 
tion set  during  the  course  of  an  interview.     Thus    R     can   add  to 
k    knowledge  which  he  gains  about  the  world  in  general  and  about 
acts  of    S    in  particular.     But    R    does  not  add  the  messages  them- 
selves to    k;   he  is  motivated  to  keep  the  messages  separate  from 
k    by  his  desire  to  evaluate  the  messages  relative  to    k  .        If  he 
adds    m    to    k,  it  is  because  he  observes  the  world  and   sees   for 
himself  that    m    is  true.     R    is  in  continual  competition  with    S 
to  see  how  much  better    S    is  than    R    as  a  source  supplying   in- 
formation to    R  —  one  motive  for  this  is  that    R    wants    to   pay  S 
as  little  as  possible  for  the  information  which    S    sends   him  — 
but    R    nevertheless  wants  to  keep  an  accurate  record  of  S's  per- 
formance as  an  information  source.     Clauses  (2-b)  and  (2-c)    se- 
cure that  an  interview  is  articulated  clearly;    they  enable    R     to 
keep  a  clear  record  of  his  re -evaluations  of  messages    and  the 
reasons  for  these  re -evaluations. 

Definition  12.4:  A  communication  event  E,*  for  R  at  t„  is 
a  change  of  mind  with  respect  to  a  communication  event  —  E- 
for  R  at  t  if  and  only  if  t_  is  later  than  tf  and  L-  =  L~ 
and   either   "  — 

(1)  k^    does  not    L-imply    k.  , 

(2)  CT    is  not  included  in  Q-7   or 

(3)  If  *    Ij  .  1 

Cases  (1)  to"  (3)  will  be  called,  respectively,  change  of  opin- 
ion, change  of  interest,  and  change  of  set.  The  case  where 
(k_i  &  k-)  is  inconsistent  will  be  called  conversion.  That  is,  E; 
is  a  change  of  opinion,  or  interest,  etc.  ,  for  R_  with  respect" 
to  E^  according  as  E;  is  a  change  of  mind  with  respect  to 
E^    and  clause  (1),     or  clause  (2),  etc.  ,    holds. 

It  should  be  noted  that    'either  ...  or'    in  this  definition  is    in- 
tended to  express  disjunction  in  the  inclusive  sense.      Clause  (1) 
allows    (k-  &  k^)   to  be  consistent,  as  is  the  case  where    k;     is 
formed  by  simply  dropping  part  of  k- .    Clause  (2)  requires    that 
Q-    is  not  simply  an   enlargement   of  ~  Cj .  . 
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Theorem  12.1:  If  the  relation  of  being  later  than  is  irreflex- 
ive  and  asymmetric,  then  change  of  mind  is  irreflexive  and 
asymmetric. 

Definition  12.5:   A  controversy  for    R    is  a  sequence   E  ,  .  .  .  , 
— n  '   wnere    n  >  0  ,   of  communication  events  for    R,  such  that 
^  1 )    ifl^i^j^n,    then 

(a)  the  sixth  member  of    E^    is  identical  with    i,   and 

(b)  if  E-  is  a  communication  event  for  R  at  tg  and 
E^  is"  a  communication  event  for  R  at  tf  and  t„ 
is"  later  than    tf  ,    then    j  >  i  ; 

(2)   if  1  <  i^  n,   then     " 

(a)  ^  =  L.^, 

(b)  if"k.  =fkiA  then    rni  =  m.^,   Q{  =  Q.^,   and  Ii  =  1.^, 

(c)  if  Ql  *  Q"^  then    m7  =  mT^,   kr  =  k."  ,    and    17=  1"^, 

(d)  if  I-~^  I."     then    m~  =  m~    ,   kT  =  k7    ,    and  Q7  =  Cj.     . 
v    '         "I       1-1  — l      —l-i     -1      -i-i  —i      —i-i 

Possibly    R    will  change  his  mind  during  the  course  of  a   con- 
troversy;   in  general,    a  controversy  will  resemble  a  chain  of  r_  +  l 
interviews  linked  by    r_    changes  of  mind.   If  a  controversy  has    r_ 
changes  of  mind,   it  maybe  called  an   (r_+l)-stage  controversy. 
Interviews  would  then  be  one-stage  controversies. 

Controversy  is  the  most  general  type  of  communication  situa- 
tion we  shall  consider  in  this  book.  We  shall  not  consider  com- 
munication situations  in  which  R  changes  his  language.  Inci- 
dentally, just  as  a  single  communication  event  may  count  as  an 
interview,  so  single  communication  events  and  interviews  count 
as  controversies.  We  use  these  terms  in  this  way  both  to  sim- 
plify our  analysis  and  for  heuristic  purposes;  the  reader  should 
always  keep  in  mind  the  possibility  of    R's  changing   his   mind. 

Definition  12.6:    Let    E  ,  .  .  .  ,  E      be  a  controversy.       Then: 

(1)  E-    is  a  growth  of  knowledge  in    E  ,  .  .  .  ,  E^    with  re- 
spect to    E.       if  and  only  if    k-    is   not  L-equivalent  to 
k.       but    k-~    L-implies    k.     .  ~ 

(2)  ET    is  a  growth  of  interest~in    E  ,  .  .  .  ,  En    with  re- 
spect to    E.       if  and  only  if    Q.       is  prope"rly  included 
in    Qi.  i_1  ~ 

(3)  Ej    is  a  growth  of  wonder  at    p    in    E  ,  .  .  .  ,  En    with 
respect  to    E.         if  and  only  if    I-(p)  >  L_,(p)  • 

$       $       $       $       $ 

A  problem  arises  in  connection  with  situations  —  in  controver- 
sies —  where    R    is  able  to  do  some  or  all  of  the  following  simul- 
taneously: 

(a)  receive  the  signals  which  convey    m 

(b)  analyze  the  content  of    m 
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(c)  evaluate    m 

(d)  change    k,    Q,    or    I. 

Consider  a  sender  who  announces  'I  have  a  speech  d-d-defect'. 
In  the  process  of  receiving  and  examining  the  message  here,  R 
comes  to  know  that  what  it  says  is  true.  In  order  to  evaluate  m, 
R  must  determine  how  informative  m  is  .  But  the  informative- 
ness  of  m  depends  in  part  on  the  content  of  k  .  So  R  must  decide 
whether  k  implies  what  m  says,  or  whether,  although  k  "now" 
implies    m  ,   k    did  not  imply    m  "at  the  time    m    was  received". 

Can  we  stipulate  a  rule  prescribing  one  ordering  of  (a)    to    (d) 
which  will  be  adequate  for  all  situations    of  the    type   in  question 
here?    There  are  many  situations  where  a  general  rule  would  be 
not  only  tolerable  but  definitely  desirable.       There   is   the   tele- 
typist whose  machine  is  old;     if  he    sends    'One   of   -y  keys    is 
-issing'    as  a  separate  message,   is  he  wasting  the  company's  time? 
There  is  the  lookout  on  the  ship;    should  he  announce  things  which 
the  officer  standing  next  to  him  evidently  sees  for  himself? 

Concerning  these  matters  we  make  the  following  assumption. 
This  assumption  does  not  affect  the  developments  below,  so  it  is 
stated  only  informally.  The  point  of  stating  the  assumption  here 
is  to  clarify  the  intended  interpretation  and  application  of  this 
analysis.  The  assumption  is  that  in  cases  where  R  is  able  to 
do  some  or  all  of  the  following  simultaneously,  he  does  them  in 
the  order  indicated: 

(1)  receive  the  signals  which  convey    m 

(2)  change    k 

(3)  change    I 

(4)  change    Q 

(5)  analyze  the  content  of    m 

(6)  evaluate    m  . 

The  priority  of  (1)  over  the  others,  and  of  (5)  over  (6),  is  clearly 
reasonable.     The  priority  of  (2)  over  (4)  is  natural   in  that   the 
members  of    Q    are  normally   L- implied  by    k.      The   priority  of 
(3)  over  (4)  seems  natural  because  of  cases  in  which  a  change  of 
set  might  suggest  new  questions,    and  because  any  change  of   set 
caused  by  asking  a  question  would  have  been   caused  previously 
by  knowing    k  .     The  priority  of  (2)  over  (3)  is  natural  for  any    R 
who  wants  his  change  of  set  to  be  caused  by  change  of  knowledge. 
Giving  (2)  priority  over    (5)   benefits    R    in  the   following   ways: 
Where    m    is  false  and    -m    is    L-implied  by  the  new    k    but  not 
by  the  old    k,    R    can  reject    m    immediately.    Where    m    is  true 
and    m    is    L-implied  by  the  new    k    but  not  by  the  old  k  ,    R   does 
not  have  to  pay    S    for    m  ;  hence    S    will  be  motivated  in  general 
to  speed  up  his  message -sending.   In  practice    S   learns  not  to  send 
m    at    t_,    or  not  to  expect  pay  for    m,    in  cases  where  he  sees  that 
R    can  learn  at    t ,    independently  of    S  ,   that    m    is  true. 
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Giving  (4)  priority  over  (6)  benefits    S    in  that,   normally,   the 
more  questions  there  are  in    Q    the  more  value  there  is   in     m  . 
Giving  (4)  priority  over  (5)  benefits    R    in  that,    although  he   will 
have  to  pay  for  answers  to  questions  which  he  had  in  mind  before 
he  knew  what    m    said,   he  will   not  have   to   pay  for   answers   to 
questions  which  the  content  of    m     stimulated  him  to  ask.       Since 
R    is  free  to  enlarge    Q     whenever   he    chooses,   he   might,    even 
after  he  sees  how  he  might  fruitfully  enlarge    Q  ,   be  tempted   to 
postpone  enlarging     Q,    in   order    to   postpone   having   to   pay     S. 
Nevertheless,    R    ought  not  to  postpone  this  if  he  wants  an  accu- 
rate record  and  measure  of  the  richness  of    S;    he    ought   in  his 
own  bookkeeping  to  enlarge    Q    whenever  he  is  cognitively  stim- 
ulated to  do  so.     He  is    still   free   to   try  to   deceive    S    about  the 
actual  content  and  growth  of    Q  ,    in  order  to  deceive    S    about  the 
pay    S    ought  to  receive.     Of  course,    S    can  defend  himself  in  the 
long  run  by  cross-examining  or  boycotting    R  . 

The  ordering  (1)  to  (6)    given   above   has   normative   import  in 
that  it  could  be  prescribed  as  a  rule  for    R    to  follow.  It  has  em- 
pirical import  in  that  it  seems  possible  to  test  actual  behavior  in 
this  respect.     It  might  be  discovered  that  people    show  no   regu- 
larity in  this  respect  at  all.      What  would  show  that  our  assump- 
tion is  unrealistic  would  not  be   the   discovery  of  no   regularity, 
but  the  discovery   of   some    regularity   other    than   our    (1)    to    (6). 
Our  point  here  is  not  so  much  to  commit   ourselves   to   one   par- 
ticular ordering,   but  rather  to  show  that  it  is  possible  for    R  to 
adopt  some  rule  or  other. 

rfc     s8c     rfc     ric     ric 

It  might  happen  that  in  a  change  of  mind    R    drops  part  of     k, 
and  in  a  subsequent  interview  resubscribes  to    k,  so  that  the  end 
of  the  controversy  is  not  a  change  of  mind  with  respect  to  the  be- 
ginning.    But  we  can  consider  controversies     E  ,   .   .  .  ,  En  for  R 
such  that    En    is  a  change  of  mind  with  respect   to    E    ;        these 
might  appropriately  be  called  dialectics.        We    shall    do  nothing 
with  this  idea  here,    beyond  noting    the   following   possibilities  of 
analysis:     The  number    of   communication   events    in   a   dialectic 
may  be  taken  as  the  length  of  the  dialectic.  The  number  of  changes 
of  mind,    divided  by  the  length,    might  be  taken  as  a  measure    of 
density.    As  a  measure  of  intensity  we  might  take  the  sum  of  the 
positions  of  the  changes  of  mind  and  divide    it  by   the    length;      a 
dialectic  is  then  more  intense  if  its  changes  of  mind  occur  nearer 
the  end.       Further  refinements    of  these    concepts   become   pos- 
sible when  we  consider,    in  addition  to  the  number  and  position 
of  the    changes    of  mind,    the    differences   between,    say,      I(k.) 
and    I(k.)  .  "    - 
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Definition  12.7:    The  sender    S    in  a  controversy    E E 

for    R    is  the  sequence    m    .....  m     . 


— n 


It   might  be    objected   that    such   an    S    is    too   abstract,    in   that 
no  restrictions  are  made  on  the  selection  of  the   messages  which 
constitute    S  ,    whereas    in   practice    a   receiver    is    willing  to  at- 
tend  to  only  a  few  sequences  of  messages   and  these   are    chosen 
according    to    flesh-and-blood   considerations    in   the    case    of   in- 
dividual human  senders  or  somewhat   more    complex   considera- 
tions   in   cases    where    the    sender    is    complex,     like    a   business 
corporation.       In  reply  it  can  be  argued  that  identifying  a  sender 
with  his    messages    is    one  of  the  marks    of    a   rational    receiver; 
the    gist   of   one    such   argument   is    that   if  you   want   to   build   a 
thinking  machine  you  build  it  to  receive  and  send  messages;    you 
need   not  build   it   to    enjoy    strawberries    and    cream   or    enter 
beauty  contests.1     Furthermore,    identifying    senders  with   mes- 
sages   is    a   useful   technical    device  for  a  receiver  who  is  trying 
to  infer  from  a  given  set  of  messages  to  a  flesh-and-blood  sender 
who  is  unknown.     Such  is  the    case  of  a  scholar  who  is  trying    to 
infer  from  a  set  of  fragments    to   an   author    of  those    fragments. 
The   fragments   are   not   given   in  any  one   order,    so  the    scholar 
is   free   to   arrange    them   in   any  order  he    chooses;     each   order 
can  be   regarded  as    a   different  sender,    and  the    scholar's  prob- 
lem then  is   to   discover    a    sender  — that   is,    ordering  — having 
certain    characteristics.        Despite    our    formal   reduction   of     S, 
however,    -we    shall   for   heuristic   purposes    continue    to  speak  of 
S     as    though   he    were    a   flesh-and-blood  human  being   who   has 
abilities    and   interests    and   for    various    reasons    sends    the    se- 
quence   m  ,   .   .   .  ,  m 

sje     ije     ijc     $     9jc 

So  far,  we  have  concentrated  on  communication  situations  in- 
volving one  R.  Using  the  same  techniques  of  analysis,  we  can 
also  investigate  communication  situations  for  groups  of  R's;  we 
can  analyze,  for  example,  the  notion  of  tradition.  This  concept 
will  be  of  direct  interest  to  us  later,  in  connection  with  the  theory 
of  meaning. 

We  shall  here  construct  a  number  of  tradition  situations    in   a 
way  which  parallels  our  construction  of  communication  situations 
for  the  individual    R  .     Corresponding  to  communication  event  we 
define  tradition  event;    tradition  events  can  then  be   used   as    our 
building  blocks.     Corresponding  to  interviews  we  construct  tra- 
dition epochs;    corresponding    to    changes    of  mind   we    construct 
transitions;    and    corresponding    to    controversies    we    construct 
histories.     The  idea  which  is  basic    to   both   our    method   of   con- 

1  Turing   [1950]. 
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struction  and  the  interpretation  of  our  constructs  is  this  :         We 
construct  a  tradition  for  a  set  of  people  indirectly,  by  construct- 
ing a  set  of  artificial  communication   events;     these    events    are 
constructed  by  abstracting  certain  elements  from  communication 
events  in  which  the  people  in  question  have  been   involved  in  the 
appropriate  way. 

Definition  12.8:   A  tradition  event    Z    at  time    t    for  a  set  of 
communication   events    E    ,  .  .  .  ,  Er  ,   where    r_  >  0   and  each 
of    E    ,     .  .  .  ,    Er     is   a   communication  event  for  some   R   at 
t  ,    is  an  ordered~sextuple    <  m    ,    L    ,k    ,Q    ,1    ,i>  such 

Z  Z  Z        — »Z         "Z  z 

that  ~         —       —       — 

( 1 )  if   1  <  i  <:  r_ ,   then 

(a)  L^  =  Lt    and    Iz  =  Ij > 

(b)  !sz    is  consistent  and  is    L-equivalent  to    (k  &.  .  .  &  kr) 

(c)  Qz    is  nonempty  and  is  included  in    Q^; 

(2)  Hlz    i-s  a  statement  of    L    ;    and 

(3)  \~ is  a  positive  integer. 

In  a  tradition  event,   then,    the    Rfs  share  a  common  language, 
common  information-value  judgments,    and   common  interests. 
They  might  know  different  things,  however;    the  requirement  that 
their    Q's   overlap  is  not  sufficient  to  imply  that  their    k' s    over- 
lap in  any  important  sense.     All  we  require  about    k    is   that  the 
R's   be  able  to  pool  their  individual  knowledge  into  a  single   fund 
of    sharable   knowledge,    such  as  an  encyclopedia.       At  least   for 
the  present  we  shall  not  require    mz    to  bear  any  particular  log- 
ical relation  to    m^  .     In  the   next  few   definitions   we    shall  use 
1  rrij1,    'Li',    etc.,    to  refer  to  the  first,    second,    etc.,   members 
of  the  tradition  event    Z^ .    In  speaking  of  a  tradition  event  _Z   we 
may  omit  reference  to  the  time  or  to  the  set  of  communication 
events  for  which    Z    is  a  tradition  event,    leaving   these   param- 
eters understood. 

Definition  12.9:   A  tradition  epoch  for  a  sequence   Y  ,  .  .  .  ,  Yg, 
where     s_  >  0  ,   of  sets  of  communication  events  is  a  sequence 
Z. ,....,  Za     such  that 

(1)  Z      is  a  tradition  event  at    t-     for    Y    and   .  .  .     and    _Zg 
is    a  tradition  event   at     t  •    -1  for    Yg ;  ~" 

(2)  if    1  <  j  <  k^:   s_,    then         "*— 

(a)  the  sixth  member  of    Z_-    is  identical  with  j  ,   and 

(b)  if  Z_k  is  a  tradition  event  at  t  for  Y^  and  Z- 
is  a  "tradition  event  at  tr  for  —  Y^  and  t„  is  ~ 
later  than  _tr,    then    k  >  j~;    and  ~ 

(3)  if  1  <  i  ^   s_,   then 

(a)    L.  =  L.     ,    I.  =  I.     ,   k.   L-implies    k.     ,    and    Q.      is 


included  in     Q. 


(b)   if    kj  ^  k^,   then    mi  =  m^    and    ^  =  Q.^ 


and 
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(c)    if    Q.  *  Q^j,    then    rn^  =  m^    and    k .  =  k.^. 

Definition  12.10:  A  tradition  event  Z-  at  tg  is  a  transition 
with  respect  to  a  tradition  event  Z,-  "at  tr~ if  and  only  if  t 
is  later  than    tr    and    L^  =  L-    and    either 

(1)  k-    does   not    L  -  imply  ~k^  , 

(2)  Qj    is  not  included  in    Q-  ,    or 

(3)  Ij"*  Ii; 

Case  (1),   where  clause  (1)  holds,   will  be  called   enlighten- 
ment,    case  (2)  change  of  direction,   and  case  (3)  change   of 
presumption;    the  case  where     (k^  &  k^)    is  inconsistent   is 
revolution. 

Definition  12.11 :   A  history  is  a  sequence     Z_x,  .  .  .  ,  Zn  , 
where    n  >  0  ,   of  tradition  events  such  that 

(1)  ifl^i^j^nj    then 

(a)  the  sixth  member  of    Z^   is  identical  with    i  ,    and 

(b)  if    Zi    is  a  tradition  event  at    t„    for    Y:   and    Z_^ 
is  a  tradition  event  at    t,   for    Y^   and    t      is   later 
than    tr,    then    j  >i; 

(2)  if  1  <  i^  n"    then 

(a)  1^  =  L^;    and 

(b)  if"k.  *  k^    then    n^  =  m^,    Qi  =  Q^,   and  ^  =  ^ 
and  analogously  for    Q    a"hd  1_,  as  in  Definition  1275. 

In  these  definitions  we  make  no   requirement   concerning   how 
the    R's,   for  whom  the  initial    E's    are    communication   events, 
are  related  to  each  other.     Intuitively,   we   think  that     R's     can 
share  a  tradition  only  if  they  communicate   with   one   another    in 
some  way  and  to  some  extent.        But  some   traditions   are   more 
loosely  knit  than  others,    and  we  want  our  definitions  to  apply  to 
the  loosest  as  well  as  the  tightest  knit,   and  until  we   have    dis- 
covered empirically  what  the  degree  of  minimal  knit  is  it  seems 
pointless  to  stipulate  in  our  definitions  any  requirements  of  knit 
at  all.     What  we  can  do,   as  a  further  development,    is  distinguish 
within  the  class  of  traditions  various    subtypes    according   to   the 
degrees  of  community  and  continuity  among  the    R's.    The  mini- 
mal degree  of  continuity  is  imposed  on  a  history  by  requiring  that, 
in  each  set  of    E's,    some    E     be   a   communication   event  for    an 
R     such  that  some     E    in   a    subsequent    set   is    a    communication 
event  for  that  same    R  .     A  more  interesting  approach   is   to   use 
quantitative  concepts;    the  following  is  a  simple  example. 

Definition  12.12:    Let     Z     be  a  tradition  event  for    E  ,   .  .   .  ,  E-, 
where  these  are  communication  events  for    R's    composing 
the  set    R    .    Let    Zz    be  a  tradition  event  for    E.^.  ,   .  .  .  ,  Er  , 
where  these  are  communication  events  for     R's  "composing 
the  set    R*       Let    x    be  the  number  of    R's     in  the  inter  sec- 
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tion  of  R*  and  R* ,  and  let  y  be  the  number  of  R's  in 
the  union  of  R*  and  R*.  Then,  the  degree  of  community 
of    Z,      and     Z.      is  the  ratio  of    x    to    y . 

Thus  the  degrees  of  community  range  from   0  to  1 ,    inclusive. 
Two  tradition  events  enjoy  full  community  in  case  the    R's  cor- 
responding to  the  first  event  are  identical  with  the    R's  corre- 
sponding to  the  second. 

Definition  12.13:  A  history  is  communal  if  and  only  if,  for 
every  pair  of  adjacent  tradition  events  _Z  and  rLz  in  the 
history,    the  degree  of  community  of    Z      and     Z_    >  0  . 

Definition  12.14:    The  degree  of  communality  of  a  communal 
history  is  the  average  degree  of  community  of  pairs  of  ad- 
jacent tradition  events  in  the  history. 

Degree  of  communality  is  always  greater  than  zero,  but  never 
greater  than  one.     Because  communality  is   based  on  the    com- 
munity of  adjacent  tradition  events,    a  long  history  can  suffer   a 
high  turnover  of    R's    and  yet  enjoy  a  high  degree  of  communality. 


13.   MESSAGE-PROCESSING 


In  the  remaining  sections  of  this  book  we  shall  consider  the 
situation  of  a  rational  R  who  finds  himself  in  a  controversy,  in 
the  sense  of  Definition  12.5.  The  questions  we  want  to  consider 
are  chiefly  these:  How  can  R  apply  functions  like  I  to  the  se- 
quence of  messages  in  a  controversy?  What  is  a  rational  way  to 
keep  a  record  of  the  results  and  to  judge  the  quality  of  S  from 
this  record? 

Our  objective  in  this  section  is  to  show  how  a  sequence  of  mes- 
sages can  be  processed  —  that  is,    prepared    so    that   evaluation 
functions  can   be  applied   to  it.      That  some  message-processing 
procedure    is    needed  can   be  seen   from  the    horse-race  example. 
An  inside  dopester  gives    R    the  following    tip:     'In  the    fifth  race 
Fleetfoot   will  win,    and   in   the  sixth  race    Soreheel  will  both  win 
and  not  win.'       Let  us  refer  to  this  sentence,  or  to  its  analogue  in 
L,   by  'p'.     By  the  assumptions  of   Section   11,    since    p    is  self- 
contradictory,    I(p)  =  oo    and   V(p)  =  0  .   Also  we  may  suppose  that 
P(p)  =  0  .   According  to  one  theory  of  utility  expectation  and  mes- 
sage value,  the  gain  which  a  bettor  may  expect  if  he  acts  accord- 
ing to  this  prediction  is    a  negative    gain,    namely   the  amount   of 
his  bet.1       Now    R     cannot  pay  an  infinite  amount   of  cash,    so   he 
cannot  pay  for  messages  solely  on  the  basis  of  their  information 
values;    at  the  very  least,    R    wants  a  rule    exempting   him  from 
payment  in  cases   where  the    I    value  is  infinite.       Moreover,    it 
seems  unreasonable   that  the    V    value  and  expected  gain  be  zero 
or  negative;     they  might  be  so    for  the    tip  on   Soreheel,    but   they 
need  not  be  so   for  the    tip  on   Fleetfoot.       Surely  then,    R    wants 
a  rule  telling  him  whether  to  reject  the  entire  prediction  or  just 
the  part  about   Soreheel,    and  whether    to  reject   the    whole  tip    on 
Soreheel  or  just  the    "will  win"    or  just  the    "will  not  win". 

It  might  be   objected  that  the   Soreheel  example  is   unrealistic, 
and  hence  trivial,  on  the  ground  that  in  actual  practice  we  receive 
contradictory  messages  only  rarely,  and  when  we  do  we  immedi- 
ately cross-examine  the  sender  until  he  reneges  on  the  contra-  • 
diction.    The  reply  is  that  there  are  many  important  situations  in 
which  a  rational  receiver  not  only  passively  receives  contradic- 
tions,  but  deliberately  tries  to  provoke  them.    These    situations 
include  courtroom   cross-examination,    testing  a    set  of  writings 
for  inconsistency,   and  testing  a  person  for  competence,   loyalty, 

1  Gabor    [1952],   p.  3. 
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etc.;  in  some  of  these  situations  a  cross-examination  is  not  pos- 
sible, and  in  others  it  is  important  to  analyze  the  structure  of  the 
cross-examination  process  and  the  content  of  the  sender's  replies. 

Actual  communication  exhibits  many  different  types  of  contra- 
diction situation.       In  one,   the   message   is   blatantly  of  the  form 
(p  &  -p)  ;    in  others,  it  is  not  of  this  form  but  is  logically  equiva- 
lent to    (p  &  -p)  .       In  another  type  of   situation    -p     is    received 
after    p    has  been  received.       Sometimes  a  contradiction  is    re- 
tracted and  corrected  immediately  by  the  sender;    sometimes  an 
apparent  contradiction  is  construed  by    R    as  a  slip  of  the  tongue 
and  corrected  by    R    accordingly.     Sometimes  an  immediate   and 
free  cross-examination   is   possible;     usually     R     must   decide 
whether  the  cost  of  cross-examination  is  worth  the  amount  of  us- 
able information  to  be  gained  by  it.     In  some  contradiction  situa- 
tions   S    is  ignorant  and  innocent.     Sometimes  he   is   knowledge- 
able and  malicious;    perhaps  he  even  intends  to  use  the    "anything 
goes"    theorem,    [(p  &  -p)  ->  F] . 

In  our  analysis   below  we   disregard   contradictions   which  are 
considered  by    R    to  be  the  result  of  a  mere  slip  of  the  tongue  or 
other    "noise"   and  which    R    himself  corrects;    the   message,   in 
other  words,    is  the  end-product  of    R' s  receiving,    decoding,  and 
construing  process.     But  we  shall  take  seriously  the    "real"  con- 
tradictions, and  we  shall  treat  them  all  alike.   More  precisely,  we 
want  a  record-keeping  device  which  treats  all  contradictions    as 
data  of  the  same  sort;    if    R    wants  to  discriminate  among  contra- 
dictions,   he  does  so  not  at  the  start  of  his  record-keeping  but  at 
the  end,  when  he  can  study  the  behavior  of    S    over  a  long   period. 

Included  among    R1  s  wants,   which  we  must  now  try  to   accom- 
modate,   are  the  following :    R    wants  true  answers  to  all  his  ques- 
tions,   as  quickly  and  cheaply  as  possible.     He  agrees   to  pay     S 
according  to  the  value  of    S's  messages.     But  he  wants  to  be  able 
to  delay  actual  payment  until  he  has   had  time   to    cross-examine 
S  ,   verify  his  assertions,    analyze    S's  behavior,    etc.      He   wants 
a  scoring  system  whereby  he    can   assign   scores   to   messages, 
both  to  determine  rewards  and  penalties  and  to  construct  graphs 
of    S's  information  behavior  over  any  period  of  time.       He   wants 
a  scoring  system  such  that 

(1)  a   contradiction  is   accorded   semantic   value  but  not  pay 
value, 

(2)  a  contradiction  produced  by  one  message  is  not   allowed 
to  influence  the  analysis  of  messages  arriving  later, 

(3)  where  he  can  analyze  a  contradictory  result  into  two  parts, 
one  defective  and  the  other  not,  he  may  penalize  and  iso- 
late the  defective  part  but  reward  and  utilize  the  nonde- 
fective  part,    and 

(4)  falsehoods  and  contradictions  are  forced  to  appear  rela- 
tively early,   if  at  all,    in  the  message  sequence. 
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There  are  many  possible  procedures  for  handling  contradictions, 
depending  on  what  R  wishes  to  salvage  from  the  wreckage  pro- 
duced by  a  contradiction.  The  following  definitions  are  based  on 
the  assumption  that  if  R  receives  a  message  which  is  consistent 
but  contradicts  earlier  messages,  he  will  retain  the  later  mes- 
sage and  reject  the  earlier  ones.  The  rationality  of  this  policy 
is  discussed  below. 

rfc      ric      rfc      rfc      sic 

Definition  13.1 :    The  usable  message   precedent,    umpj    of 
E^    in  a  controversy    E  ,   .  .  .  ,  En  ,    is  the  statement  formed 
iri"  the  following  way: 

To    (p  v  -p)  is  conjoined    m.   ,   where    h    is  the  greatest 
number  less  than    i     such  that    (m-r    &  k^)     is    consistent, 
and  to  the  conjunction  thus  formed- mg  "is    conjoined, 
where    g    is  the  greatest  number  less- than    h    such  that 
(S2h  8c  rcu  &  kjj    is  consistent, 
and  so  oh,  back  to  the  beginning  of  the  message  sequence. 

Here    p    can  be  some  arbitrarily  chosen  statement.  The  point 
of  including     (p  v  -p)     in   umpj    is  to  secure  that  ump^    always  ex- 
ists.    If    i  =  1,    or  if  none  of    m  .     .  .  .  ,    m.       is  consistent  with 
k.  ,   then  ump.    is    (p  v  -p). 

Theorem  13.1 :   In  a  controversy    E,,  .  .  .  ,  E    ,    (ump-  &  k^) 
is  never    L-false,   but    (m-  &  ump^)  may  be  ]T- false. 

Definition  13.2:    The  semantic  value  of    m^    in    E-     in  a  con- 
troversy   E  ,   .  .  .  ,  En    is    I^(m-  |  ump- )  . 

Theorem  13.2:   In  a  controversy    E  ,  .  .  .  ,  En  ,  the  semantic 
value  of    m^    in    E^    is    (1)    zero  in  case  ump^   L-implies    m^, 
and  (2)  infinite  in  case    (m^  &  ump-)    is  L-faTse. 
Proof:   For  (1),    see  Theorem  11.57   For  (2):   I(ump-)  is  never 
infinite,  but    I(m^  &  umpj)     is  if    (m^  &  umpj)     is    L-false. 

The  above  two  theorems  show  that    semantic   value    cannot  be 
used  as  a  reward  or  payment   function,   but  might  be   useful  in 
keeping  a  record  of    S!s  behavior.   A  graph  showing  the  semantic 
value  of  messages  will  show  the  amount  of  "news"    (relative   to 
previous  messages)  brought  by  each  message.         The  graph  will 
register  an  infinite  value  for  a  contradiction  but  will  then  quaran- 
tine the  contradiction.     The  ump-   concept  is  perhaps  most  useful 
to    R    in  connection  with  the  following  definition. 

Definition  13.3:    q    is  a  test  question  in  a  communication 
event    E-     in  a  controversy    E  ,  .  .  .  ,  E       if  and  only  if 

(1)  q~is  in    Qj,  ~ 

(2)  q    is  L-implied  by  ump-,    and 

(3)  some    d    to    q    is    L-implied  by    k^. 
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The  purpose  of  the  second  clause  here  is  to  secure  that   if     R 
actually  asks    q  ,   he   is   not  thereby  giving    S    any  information. 
The  definition  of   'test  question1    which    R    will  need  in  general  is 
not  Definition  13.3  but  a  refinement,    Definition  13.8,    given  below. 
The  basic  concept  of  our  system  of  message-processing  is  given 
by  the  following  definition: 

Definition  13.4:    The  usable  message  total,    umt^  ,    of    E^    in 
a  controversy    E  ,  .  .  .  ,  E    ,    is  the  statement  formed  in  the 
following  way: 

To    (p  v  -p)    is  conjoined    m.    ,   where    h    is   the   greatest 
number  less  than  or  equal  to-  i    such  that     (m.    &  k^)     is 
consistent,    and  to    the    conjunction   thus    formed    m~      is 
conjoined,    where     g     is  the  greatest  number  less  than    h 
such  that    (m^  &  m„  &  k-)     is  consistent, 
and  so  on,  back   to~T:he  beginning  of  the  message  sequence. 

Theorem  13.3:    In  a  controversy    E  ,   .  .  .  ,  En,   (umt^  &  k.) 
is  never    L_- false,   but    (m-  &  umt^)    may  be    L- false. 

The  chief  difference  between    ump-    and  umt'    is  that    m^    can 
be  in  umt.  but  cannot  be  in  ump^  unless   m-  =  m      for  some   g  <  i  . 

Theorem  13.4:   In  a  controversy    E^,  .  .  .  ,  En,    if    (k^  &  m^ 

&  umt.    )  is  consistent  and    k^  is  L-equivalent  to    k.  ~     the"h 

umt>  is  formed  by  conjoining  ~(p  v  -p),    m^,  and  just"   those 

messages  which  were  in  umt. 

Proof:    Because     (k  •  &  m^)    is"  consistent,    m^    is  in   umt^. 

Next  we  proceed  by~induction  on  the    "distance"    from    m-  : 

Base:    If    m.  ,     is  in  umt-  ,,    then  it  is  in  umt.: ,  because  if 

— 1-1  i-i  1. 

it  were  not,~Th.en    (k^  &  m-  &  m.     )    would  be    inconsistent, 

making     (k-  &  m-  &  umt.    )~  inconsistent.     If    m.  ,    is  not  in 
°     x— l       — l  i-i'  — i-i 

umt.     ,     then     (k-  ,    &  m-  .)     is  inconsistent,  making     (k^ 
&  m".   &  m-  .)     inconsistent,    so  that    m.       is  not  in  umt^  . 
Step"    We  "assume   that   every  message   from     m-        to- 
m-    .     is   such  that  it  is  in  um.t-     just  in  case  it  is  in  umt.    . 
Now  consider    m.   ,.,   ,  .     Suppose  it  is  in  umt.  ,    but  not  in 

umt^.      Then  it  is  inconsistent  with    (k-  &  m^)     and   those 
messages  from    m.        to    m-  •    which  are   in  umtj.  Now  the 
latter  are  all  in  umt     ,    so    rri-  *. .   .     is  inconsistent  with 
(k.  fk  m-  &  umt.     ),    and  hence     (k^  &  m^  &  umt.  j)  is  incon- 
sistent also.    Next  suppose  that    m.   ...    .     is    not   in  umt.     . 

Then  it  is  inconsistent  with    k.       ancT  those  messages  from 

—i-i  ° 

m.       to    m.   .     which   are    in    umt-  , .    The  latter  are  in  umt-, 
— i-i  — i-i  Iri  1 

so    m.   ,.  ,    .      cannot  be  in   umt.. 
— lr(j_+i)  i- 

It  might  be  objected  that  the    "policy   of  rejection"   formalized 
by  our  definitions  of   'ump'    and    'umt'    is  unrealistic,  on  the  ground 
that  in  real  life  when    R    receives    a  message   which  produces   a 
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contradiction  he  does  not  employ  a  routine  of  message  rejection 
but  merely  makes  an  ad  hoc  analysis  to  determine  what  noncon- 
tradictory  message-selection  S  would  probably  subscribe  to  if 
he  were  challenged.  The  first  reply  is  that  sometimes  there  is 
no  good  basis  on  which  to  judge  what  S  would  probably  do,  and 
in  other  cases  the  reasonable  probability  judgment  which  R 
makes  happens  to  be  wrong,  and  perhaps  even  because  S  delib- 
erately plans  it  this  way. 

The  second  reply  is  that,    regardless    of  actual  practice,    the 
policy  embodied  in  these  definitions  is  a  rational  one    for    R     to 
use.     It  favors     S    in  that  it  allows     S    without  penalty  to    correct 
slips  of  the  tongue,    to  learn  from  experience,    to  report  a  change 
in  the  world,    or  simply  to  change  his  mind.       All   this    should  be 
acceptable  to    R    so  long  as    S    is  not  growing  senile.         It  posi- 
tively favors     R    in  the  important  cases  where    S    tries  to  deceive 
R    because  he  wants  to  persuade    R    and  the  deceit  is   necessary 
as  a  means  to  the  persuasion.     Of  course,    S    wants    R     to   re- 
ceive and   "register"   the  persuasive  aspects  of  his  message  se- 
quence,  but  not  the  logically  defective  aspects.     Since    R    wants 
whatever  usable  information  he  can  get  from    S  ,   he  is  willing  to 
be  persuaded,    so  long  as  the  information  and  persuasion  are  not 
defective.      Though    R    must  be  wary,   he  cannot  afford  to  put  ex- 
cessive energy  into  being  a  perfect  logician.       Unfortunately  for 
both    R    and    S  ,    information  and  persuasion  can  usually  be   ex- 
tracted from  a  message  only  by  a  process  which  also    considers 
the  logical  structure  of  the  message.     This  means  that  S   usually 
cannot  construct  messages  with  the  persuasiveness   on  the  surface 
and  the  logical  defects  hidden  inside.  It  also  means  that  R   some- 
times cannot  tell,   by  a  mere  rough  calculation  of  expected  profit 
or  loss,   whether  or  not  he  should  be  a  perfect  logician  and  dig  to 
the  interior  of  the  message.     The  policy  of    S,    then,    is   to    con- 
struct a  long  message  sequence,    such  that    R    will   receive   and 
be  persuaded  by  certain  individual  messages,    and  these  individ- 
ual messages  will  be  defective  —  if  at  all  — ■  only  in  the    context 
of    k    and  the  message  sequence  as  a  whole.     If,    as  is  the  usual 
situation,     S     does  not  know  what    k    is,    and  when  or  how  often  R 
will  be  a  perfect  logician,    his  only  hope    for    deceiving    R     is    to 
divide  his  defective  message   into    smaller   messages,    perhaps 
defective  in  conjunction  but  not  defective  when  taken  singly,    and 
scatter  these  smaller  messages  widely  over   the   total   message 
sequence.     This  scattering  is  justified  by  the  assumption  that   a 
defect  is  less  likely  to  be  detected  in  a  long  sequence   than  in  a 
short  one.     On  the  other  hand,    S     cannot  make  his  sequence    too 
long,   because    R's  patience  might  be  limited.     Here    S     compro- 
mises,   and  arranges  his  messages  in  an  order  of  priority;         he 
puts  the  most  urgent  message  close  to   the   beginning    of  the    se- 
quence.    Now  suppose  that    R    announces  that  he  intends  to  proc- 
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ess  messages  via  Definitions   13.1  and  13.4.    This  announcement 
forces    S    to  choose  between  equally  unhappy  alternatives:  First, 
he  might  place  the  most  urgent  part  of  his   message    set   early 
enough  in  the  sequence  so  that    R    will  be  sure  to  receive  it,   but 
later  than  the  other  parts  of  the  sequence  with  which  it  conflicts, 
so  that  it  will  not  be  rejected  via    R's  policy.         Or    second,   he 
might  place  the  conflicting  messages  so  far  apart  that    R  will  — 
hopefully  —  not  discover  any  defects  until  after    S    has  persuaded 
R    and    R    has  acted  accordingly.       If    S    takes   the   first   course, 
all  of  the  conflicting  parts  of  the  sequence  will  have  to  be  placed 
relatively  close  to  the  beginning  of  the    sequence,  and  it  will  be 
easier  for    R    to  detect  the  trouble;    if  he  takes  the  second,  he  is 
forced  to  the  expense  of  a  long  sequence,    and    R    will  probably 
become  suspicious  and  decide  to  be  a  perfect  logician. 

Is  our  policy  applicable  in  the  case  where    S    wants    to   do   his 
own  censoring?    This  is  the  case  where    R    challenges  or  cross- 
examines    S    and    S    acknowledges  his    inconsistency  and  blurts 
out   'But  I  didn't  mean  that;    I  take  it  back'.        In  our   model,      R 
can  allow  this.     He  can  start  afresh  with  the  censored  message 
sequence,    construing  all  of  its  parts  as  messages   which  arrive 
simultaneously  at  the  time  of    S's    revision,  and  he  can  also  con- 
tinue to  keep  track  of  the  original,   unexpurgated  sequence.       He 
regards  each  sequence  as  a  different  sender.     When    "pay  day" 
arrives,  the  pay  value  of  one  might  be   greater   than  that   of  the 
other,   but,   because    k,    Q,    and    I    can  change  with  time,  the  ad- 
vantage might  not  lie  with  the  censored  sequence.  The  technique 
of  keeping  track  of  both  expurgated  and  unexpurgated  message 
sequences  is  a  familiar  one  since  Freud;    if    S    makes  a   "Freud- 
ian slip,"   he  corrects  it, and    R    pretends  to  pay  attention  only  to 
the  corrected  sequence,   while  inwardly  analyzing  the  content   of 
the  uncorrected  sequence. 

There  are  many  logically  possible  alternatives  to  our  policy  of 
rejection;    they  could  be  formalized  as  alternatives  to  Definitions 
13.1  and  13.4.     One  alternative  is  to  take  the    (n/2)       message  — 
or  the    (n  +  l)/2t      message  in  case   there   is    an  odd  number   of 
messages  —  as  a  base  point  and  work  forward  and  backward  from 
it  in  alternation;    this  procedure  corresponds  roughly  to  the  proc- 
ess of  reconstructing  an  ancient  writer's  works,    starting   from 
the  work  of  his    "floruit"   period  and  assimilating  other  materials 
to  it.     Another  alternative  is  to  build  up  the  usable  message  total 
by  a  random  process.     Suggestive  of  still  another  alternative   is 
the  way  some  women  read  novels;    they  dip  into  the  novel  at  ran- 
dom until  they  find  an  interesting  love  scene  and  then  work  for- 
ward and  backward  from  it.       A  principle   of  a  different  type   is 
represented  by  the  rule:    'Whenever  possible,    divide  the  defect- 
ive message  into  two  parts,   one  defective  and  the   other   not,    in 
such  a  way  that  the  nondefective  part  is  salvaged  and  the  informa- 
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tion  loss  due  to  rejecting  the  defective  part  is  minimized.1  For 
example,  if  R  believes  that  -Fa  is  true  and  he  receives  (x)Fx, 
he  could  cut  his  information  loss  by  dividing  (x)Fx  into  Fa  and 
(x)(x  ^  a  ->  Fx)  and  rejecting  only  Fa.  If  R  believes  only  that 
(Ex)  -  Fx  is  true  and  he  receives  (x)Fx  ,  not  much  could  be  sal- 
vaged from    (x)Fx  . 

Not  all  of  the  logically  possible  policies  are  equally  good,  of 
course.  A  policy  which  reverses  the  one  chosen  above,  retain- 
ing the  earlier  messages  and  rejecting  the  later  ones,  invites  S 
to  place  his  most  attractive  message  first,  so  that  it  is  sure  to 
be  received  and  —  unless  it  conflicts  with  k  —  never  rejected. 
A  policy  which  rejects  all  messages  rm  ,  .  .  .  ,  m^,  such  that 
(m-  &.  .  .  &  mi    &  k)    is  inconsistent  is  "too  wastefulT 

-It  seems  worth  investigating  empirically  what  policies  of  re- 
jection people  actually  use.  Meanwhile  we  consider  that  the  pol- 
icy we  have  chosen  here  is  a  realistic  one,  in  the  sense  that  there 
is  some  empirical  evidence  that  people  actually  do  behave  in  this 
way.  For  example,  when  the  small  print  in  a  legal  contract  or 
newspaper  advertisement  or  newspaper  story  conflicts  with  the 
heading  or  the  headline,  we  —  or  at  least  those  of  us  who  remain 
solvent  for  long  —  trust  the  small  print.2 

ik     ik     sk     sjc     sk 

A  concept  which  for  many  purposes  is  the  most  important  con- 
cept of  our  message-processing  procedure  is  given  by  the  following: 

Definition  13.5:    The  final  usable  message  total,   fumt:  ;  ,    of 
E-    with  respect  to    E;    in  a  controversy    E  ,  .  .  .  ,  E^  ,  .  .  .  , 
E7,  .  .  .  ,  En  ,   where  "1  ^  i  <!  j  <:  n,    is  the  alphabetically  first 
conjunction- whose  conjuncts  are    (p  v  -p)    and  just  those  mes- 
sages  common  to  both  umtj   and  umtj. 

In  effect,   fumt-  •    is  the  message  total  which    R    considered  us- 
able  at     E-     and-'  still  considers  usable  at    E,  .     Note  that  fumt.:  , 
— 1  — 'J..  .  -'J 

is    L- implied  by  both  umtj   and  umt:,  and  that  fumt^  ^  is  L-equiv-~ 

alent  to   umt:. 


2  We  consider  that  the  findings  in  Hovland  [1957]  are  inconclusive  for 
our  purposes.     Hovland  concludes  that   "primacy"   does  not  necessarily 
dominate    "recency",   but  that  order  of  presentation  does  seem   to   influ- 
ence opinion  in  subjects  with  relatively  low   "cognitive  need".     Anderson 
[1961]  reports  a  primacy  effect  under  certain  conditions  where  the    sub- 
jects were  instructed  to  form  impressions  of  people.        Julian   Feldman 
has  pointed  out  to   me   that,   where    a    subject's   memory  resources   are 
limited,    a  primacy  or  recency  effect  might  represent  the   influence    of  a 
strategy  of  memorization.       In   effect,   the   model  being  presented  here 
assumes  that    R    is  sufficiently  well  equipped  and  well  organized    so  that 
memory  factors  do  not  influence  message-rejection. 
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Theorem  13.5:   In  a  controversy    E  ,  .  .  .  ,  E_n  ,    if  umtj  is  L- 

equivalent  to   umt.     ,   then  umtj    includes   all- the   messages 

•which  are  in  umt.-    . 
1-1 
Proof:    If  not,    then  some    m    in  umt.      is  not  in  umtj,    so 

(k^  &  umt-  &:  m)    is    L.-false.     But  then  so  too  are    (k"j"  &  umt. 

&  2i)>     QSi"&  urnt-_i)»   and   (JSi  &  umt^),    which  by   Theorem    ~ 

13.3  is  impossible". 

Theorem  13.6:   In  a  controversy    E  ,  .  .  .  ,  E    ,    if  umt^    is 

L-equivalent  to   umt.      and   (m-  &  k^)     is    L_-false,  then  fumt^  n 

is    L-equivalent  to  fumt. 

Proof:   Suppose  there  is~ari~  m    in  fumt-        not   L-implied  by 

fumt-  ,  _.     This    m    is  in  umt;    but  is  ribF  m:  ;    also,  it  is    in 
l-l, n  —  i  — i' 

umL   but  cannot  be  in  umt.  ,  ."Hence    (k.  ,  &r  umt.       &  m)    is 
n  i_i  v— 1-1  i-i         — ' 

inconsistent.       Now    m     is"  Li-implied  by  umtj  a"nd  hence  by 

umt-,,    so    (k-      &  umt-,)    is  inconsistent.   But"  by   Theorem 

13.3-  this  is  impossible.     The  supposition  that  there  is  an  m 

in  fumt-  not   L-implied  by  fumt-  n  can  be  treated  similarly. 

Example- (1):    Let    k    be    (p  v  -p)T  ~m     be    p    ,   where   p     is 

neither    L-true  nor    L-false,    m     be     -p  ,    m     be    p    ,   where 

p      is  consistent  with  both    p      and    -p  ,   and    m.    be    --p    .  Then: 
—3  —i  —i  — 4  £_i 

umtj    is    [(p  v  -p)  &  pj, 

umt2    is    [(p  v  -p)  &  -pj, 

umt3    is    [(p  v  -p)  &  p    &  -p  ], 

umt4    is    [(p  v  -p)  &  --Pj    &  P3   &  Pj], 

fumt    4  is    L-equivalent  to    p  , 

fumt        is    L-equivalent  to    (p  v  -p), 

fumt        is    L-equivalent  to    p    ,   and 

fumt  '     is    L-equivalent  to    (p    &  p   ). 
Thus  fumt,       is    L-implied  by  neither  fumt        nor   fumt    .  . 

Example  (2):   Suppose  that    Q    consists   just  of   (Fleetfoot 
wins)?   and   (Soreheel  wins)?   and  that    k    is    L-true  and  thus 
vacuous.     Suppose  also  that    m      is    'Fleetfoot  wins'.         Now 
consider  three  cases: 

(a)  m      is    'Soreheel  does  and  does  not  win',   where   this 
sentence  is  interpreted  as    L-false. 

(b)  m2    is    'Soreheel  wins',   and    m,   is    'Soreheel   does  not 
win' . 

(c)  m2   is    'Soreheel  does  not  win',  and    m      is    'Soreheel 
wins' . 

In  case    (a)   the  semantic  value  of    m      is  infinite;    likewise 

for    m      in  cases  (b)  and  (c).     In  case  (a)  umt2,   fumt1  2,  and 

fumt2  2  are  all   L-equivalent  to    rrij  .       In   cases    (b)   and   (c) 

umt,  and  fumt        are    L-equivalent  to    (m,  &  m,)  :        fumt,  , 
3  3,3  —      ^  — l        — 3'  1,3 

and  fumt        are    L-equivalent  to    m    .     Suppose  that  R  pays 
S    via    I_,    V,   or  the  like,   paying  not  for  all  the  messages  but 
only  for  the  content  of  umt      or   fumt-  n.     Then    R    pays   for 
'Fleetfoot  wins'    in  all  three"  cases,  for- 'Soreheel  wins'    only 
in  case  (c),  and  for  'Soreheel'    does  not  win'    only  in  case  (b). 
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The  next  two   definitions    give   auxiliary   concepts   which  are 
technically  useful  in  later  developments. 

Definition  13.6:   fumt^'   is 

(1)  (p  v  -p)   in  case    i  =  1  ; 

(2)  (fumt~     &  fumt2  n  &.  .  .  &  fumt.^  r)    in  case     i  >  1. 

Theorem  13.7:   In  a  controversy    E^,  .  .  .  ,  En  ,    (fumtj*   & 
fumtj  n  &  k^)    is  never    L-false. 

Prooi'rThis~is    L-equivalent  to   (fumtj  n  &.  .  .  &  fum^  n  & 
kn),   hence  is    L-implied  by   (umtn  &  k~),  which  by  Theorem 
13~.3  is  not   L-false. 

Definition  13.7:   We  shall  use    'Qij'    to  refer  to  the   logical 
intersection  of    Q^    and    Qj,    and'-'k^j'    to  refer  to    (kj  v  kj). 

Thus    kj  i     is    L-implied  by  both    k^    and    kj  ,     and  every   wff 
which  is    L=implied  by  both    ki    and    k-     is    L=implied  by     k^j  . 
Because  of  this  fact,    k.  .     can~be  regarded  as  the  strongest  '"over- 
lap"   of    k^    and    k .  . 

Theorem  13.8:    In  a  controversy    E^   .  .  .  ,  En,  (k^j  &  fumt—) 
is  never    L-false. 

Proof:  (k-  i  &  fumqj)  is  L-implied  by  (kj_  &  umq),  which 
by  Theorem  13.3  is   never    L-false. 

Definition  13.8:     q    is  a  final  test  question  in    Qi  n    in    E.i    in 
a  controversy    E  ,   .  .  .  ,  E_n    if  and  only  if 

(1)  q    is  in    Q^n 

(2)  q    is  L-impTied  by  umpj,    and 

(3)  some  direct  answer  to  ~q    is  L-implied  by    k^  n 

Because  a  test  question  must  have  a  direct  answer  which  is  L- 
implied  by    k^,    and    k^    is  consistent,    it  follows  that   every  test 
question  has  some  consistent  direct  answer.       This  allows  every 
test  question  to  have  partial  answers.     Among  his  test  questions 
R    cannot  include  any  trap  questions  all  of  whose  direct  answers 
are  inconsistent.     In  order  to  provide  for  trap  questions    of  this 
type  we  must  add  further  apparatus  to  our  model.     For  example, 
we  might  add  a  metalanguage   ML  and  then  allow  as  trap  questions 
those  questions     q    such  that  in   ML   it  is    "ML-true"    that    q    has 
no  consistent  direct  answers.     We  shall  not  pursue  these  matters 
further  in  this  book. 

Next  we  present  two  theorems  which  will  be  useful  in  the  proofs 
of  some  of    the  theorems  to  be  given  later. 

Theorem  13.9:  In  a  controversy  E^,  .  .  .  ,  En,  if  (mi  &  k^) 
is  L-false  and  k-  is  L-equivalent  to  k-  ,  "then  fumt^  L- 
implies  fumt^  n. 

Proof:       Since- (m.  &  k^)    is    L-false,    m^    is  not  in  urm^  . 
Then,    since    k^    is"  L.-e~quivalent  to    k^,"  umt^    has  as    con- 
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juncts  just  those  which  were  already  in  umt-     .  So,  by- 

Theorem  13.6,    fumt^  n  is    L-equivalent  to   fumt.         ,   which 
is  a  conjunct  of  fumt.*.  ~     — 

Theorem  13.10:    In  a  controversy    EL,  .  .  .  ,  En,    if    k^    is  L- 

equivalent  to    k-        and    (k-  &  umt.     )    L-implies    m-  ,~  then 

(m.  &  fumt-         )~  L-implies    fumt--   . 

Proof:   By  "Theorem  13.3,    (k-      &  umt     )   is  consistent.   Now 

m-i    is    L- implied  by    (k^  &  umt.     )     arid  hence  by   (k.      &  umt'J, 

so"   (k.      &  m.  &  umt.     )     is  consistent.     Hence,    (k^  &  m^)     is " 

consistent,   a"hd  thus    m^    is  in  umt-.  Now  suppose ~(m^ 

&  fumt.         )     does  not  L-imply  fumt-  „.     Then  there  is  an   m 
l-l, n'  —  J  i»n  - 

in  fum£        which  is  neither    m-     nor  a  conjunct  of  fumt. 

X  t XX  X  X™X  » |LX 

Since    m    is  in   fumt^  n  it  is  in  umt     and  umtj,   but    since   it 

is  not  in  fumt.  ,        it  is  not  in   umt.  ,.       Hence,   by   Theorem 

i-i. n  1-1  J 

13.4,     (k.  &  m.  k  umt.  j)    is  L-false,    and  so    [k.  &  (k.  &umy 

&  umt^_J~  is  L-false   also,   making    (k.      &  umt."  )    L-false 

too.     But  by  Theorem  13.3  this  is  impossible. 

Finally,    let  us  examine  a  particular  controversy  which  has   a 
rather  interesting  property.     Let  us  consider  the  controversy 
E,,  .  .  .  ,  E,,   where 

— 1  — o 

(1)  k  is  p  and  rn,1  is  -p; 

(2)  k  is  p  and  m2  is  q; 

(3)  k  is  p  and  m3  is  (q  -*■  -r_); 

(4)  k  is  p  and  m4  is  r_; 

(5)  k  is  (p  v  -p)  and  m5  is  (r  ->  p); 

(6)  k.  is  (p  v  -p)  and  m6  is  -r_. 

In  this  controversy,   umt      contains  no  messages. 

umt     has  just    q. 

umt3    has  just    (q  -*■  -r_)    and    q  . 

umt4    has  just    r_    and    (q  ->  -r.). 

umt5    has  just    (r_  -»  p),    r_,   and    (q  ->  -r_). 

umt6    has  just    -r_,    (r.  -*•  p),    (q  -*•  -r_),   q,    and  -p. 

fumt,  ,    has  no  messages  in  it. 

1,6 

fumt    6  has  just  q. 

fumt    6  has  just  (q  ->  -r.)    and    q. 

fumt.  ,  has  just  (q  -*■  -r.). 

fumt  '6  has  just  (r.  ->  p)    and    (q  -*■  -r_). 

fumt6  6  has  just  -r_,    (r_-»p),    (q  ->  -r.),   q,   and    -p. 

Now,    the  interesting  property  of  this  controversy  is  this;  Al- 
though   k6    is  L-equivalent  to    k      and    m6    is  L-implied  by    (k6 
&  fumt*),    still,    fumt6  6    might  not  be  L-implied  by   (m6  &  fumt^). 
The  reason,   of  course,    is  that    m1    is    suddenly   "released"   for 
umt,    by  m6  .     The  important  point  here,    as  will  become  clearer 
in  Section  14,  is  that,  even  though    m6    is    L-implied  by  (k6  &  fumt^ ), 
m,     can  still  bring   "news". 


14.    NEWS-VALUE 


What  we  have  been  working  toward  thus  far  is  a  system  where- 
by   R     can  keep  a  record   of    S's     communicative  behavior  over  a 
period  of  time.     We  have  been  aiming  specifically  at  a  system  in 
which    R    can    construct   graphs    which   plot   quantitative    data   of 
various  kinds  against  message  number,    time,    or    the   like.     The 
quantitative  data  might  include  values  of  primary  evaluation  func- 
tions —  such  as  I  —  applied  to  individual  messages  or  to   umt-  or 
fumt-,    etc.,    or  they  might  be   values  of  functions    derived  from 
the   primary  ones,   the  values    of  these   derived  functions   being 
ratios  or  percentages,  etc.    Because  of  this  diversity  of  possible 
values,    R    will  in  general  need  to  evaluate    S    by  means   not  of  a 
single  graph  but  rather  a  set  of  graphs. 

We  can  take  certain  sets  of  graph  data  and  call  them  perform- 
ances;   we  can  take  certain  sets    of  performances  and  call  them 
styles.       The  general  formal  problem  then  is   to  distinguish  and 
analyze  interesting  types  of  performance  and  style.   The  general 
empirical  problem   is   to    correlate   types   of  performance   and 
style  with  other  properties  of    R,   S,   and    Ex,  .  .  .  ,  En- 

In  order  to  study  performances  we   must,   of   course,    specify 
precisely  the  relevant  data.   Now  it  seems  fairly  clear  that  many 
interesting  kinds  of  performance  involve  data  which  are  not  ana- 
lyzed in  this  book.       One    important   example  of  this  sort   is   the 
data  which  pertain  to    S's    metalinguistic  behavior .      If    R    asks 
S    a  test  question  and  S   replies    'I  don't  know',    R    might  believe 
S    and  want  to  count   this  reply  as  standing  to    S's    credit,    or   he 
might  believe  that    S    is  lying  and  want  to  penalize  him  accord- 
ingly.      Until  we  formalize    ML,  we  cannot  analyze    credits    and 
penalties  of  this    sort,   and  thus   we    cannot   fully  assimilate   the 
theory  of  test  questions  to  our  theory  of  the  evaluation  of    S.      In 
this  book,  then,   we  can  make  only  the  bare  beginnings  of  a  study 
of  sender -evaluation.      Our  discussion  in  the  remaining  sections 
of  the  book  will  concentrate  on  two  examples  of  evaluation:   eval- 
uation of  the  news-value  of  a  sequence  of  messages,   and  evalua- 
tion of  the  adequacy  of  messages  relative  to    R's    interests. 

Definition  14.1:   A  news-value  datum  for    E^    in  a  contro- 
versy   E  ,  .  .  .  ,  E       is  a  value  of 

Iib|(ki}n>  fumti*)], 

where 

(1)   p   is  a  conjunction  of  statements     r_    such  that    r_    is    L- 

8  1 
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implied  by    (k-       &  fumt-      ),  and  if    r_    is  not   L-true 
then    r    is  a  direct  or  partial  answer  to  a  question  in 

Qi,n'    and 
(2)    if    p    is  not   L-true  then    p    is  L- implied  by   DA,  where 

(a)  DA  is  a  direct  answer  combination  of  some  non- 
empty subset  of    Q-       and 

(b)  (DA  &  kn  &  umtn)   Inconsistent. 

Hereafter  we  shall  frequently  omit   'in    E  ,  .  .  .  ,  En'    follow- 
ing   'news -value  datum  for    E-'.  ~~ 

Theorem  14.1:   In  a  controversy    E  ,  .  .  .  ,Enthe  set  of  news- 
value  data  for    Ej    has  a  least  upper  bound. 
Proof:   We  assume  here  the  real-number  axiom:     If  a  set  of 
real  numbers  is  nonempty  and  has  an  upper  bound, then  it  has 
a  least  upper  bound.     In  this  case  the  set  of  news -value  data 
is  nonempty,-  because   even  if  there   are   no   answers     r.    to 
serve  as  conjuncts  of    p,    p    still  exists  as  a  conjunction  of 
L- truths.     Concerning  the  upper  bound:    Let    z_   be 

Ii[p|%,n&  fumti*)L 

where  p  is  as  in  Definition  14.1.  Now  p  is  L-implied  by 
(fumt-       &  k-     ),    so  by  Theorem  11.6  we  have 

z  ^  L_i[(fumtLll  &  kLa)  |  (kLa  &  fumt^)]  , 

and  hence  via  Theorem  11.7, 

z  <:  IJfumt^  |  (kia  &  fumtf)]  . 

The  right-hand  term  of  this  inequality  is  finite  [see  Theorem 
13.7],    and  as  an  upper  bound  for  the  set  of  news -value  data 
it  seems  quite  reasonable. 

Definition  14.2:  The  news -value  of  E  j  in  a  controversy 
E  ,  .  .  .  ,  Ej,  is  the  least  upper  bound  of  the  set  of  news- 
value  data  for    E^. 

Definition  14.3:  The  news -value  of  a  controversy  E  .....  , 
En    is  the  sum  of  the  news-values  of  E  ,     .  .  .  ,    En    in 

Iii>  •  •  •  »  En- 
The  news -value  of  a  communication  event  E^   is  thus  the  amount 
by  which    R's     fund  of  usable  information  is  enhanced  at    K^    by 
m^    and  any  messages  which  were  received  previously  but   were 
no~t  acceptable  previously.     In  Definition  14.2  we  might  have    de- 
fined   'news -value  of    m^    in    E^'.         While    officially   abiding   by 
Definition  14.2  as  it  stands  above,   we  shall  occasionally,    in  our 
informal  discussion,    speak  as  though  it  were    m^    which  had   the 
news -value.     Hereafter  we  shall  frequently  omit    'in    Ex,   .  .  .  ,  En' 
following    'news -value  of    E-:. 
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The  news -value  of    E^    is  computed  by    R    only  after    En    has 
occurred,    the  interval  between    E^    and    En    being  the  time  which 
R_    has  available  for  verifying    S '  s    messages.     In   this    case,    R_ 
uses    Ij    rather  than    In ,   because  he   is   here   assessing   the   os- 
tensible newswor thine ss  of    m,    assuming  that    m    is  true.        If 
R    wants  to  pay  cash  for  messages  but  is  cautious    and  wants   to 
supplement  his  yes-or-no  verification  procedures  with  proba- 
bility considerations,    he  will  pay  cash  not  according  to  our  con- 
cept of  news-value  but  according  to  a  concept  of  "second-order" 
news -value.     This  would  be  like  the  concept  given  by   Definition 
14.2  above  except  that    Ij    would  be  replaced  by    I*,  defined  as 


log 


P(m.) 


where     P(mi)     is  the  probability  that    m^    is  true,  computed  after 
E       has  occurred,    and    P^(m^)     is  the  probability  that    m^     is 
true,    computed  after    k^    has" been  specified  but  before    m^    has 
been  analyzed.1 

Theorem  14. 2:    In  a  controversy    E  ,   .  .  .  ,  En  ,    if   (k^n    & 

fumt  •* )    L-implies     (k-  _  &  fumt-  „),   then  the  news-value    of 

Ei=-0. 

Proof:   Let    z.   be  any  news -value  datum  for    E^.       Then  by 

Theorem  11.6,     z  ^  ^[(k.  n  &  fum^  n)  |  (^       &~fumtj*)];  and 

thus   by   Theorem    11.5,"z.~=  0.  ~  ~~ 

Theorem  14.3:  In  a  controversy  E^  .  .  .  ,  En  ,  if  k^  is  L- 
equivalent  to  k-  and  (m^  &  ki)  is  L-false7then  the  news- 
value  of    E_i  =0" 

Proof:   By  Theorem  13.9,   fumt^.    L-implies   fumt^^-         So 
(k~~fc  fumt^)    L-implies    (k^  ~  &  fumt^J .       The   theorem 
theiTfollows'by  Theorem  14. 2~  "[Incidentally,  a  similar   use 
of  Theorem  14.2  in  several  proofs  below  will  be  left  tacit.] 

The  following  two  theorems    are    intended  to   give    conditions 
under  which    R    does  not  have  to  accord  news -value  to  messages 
which  merely  answer  test  questions. 

Theorem  14.4:    In  a  controversy    E  ,   .  .  .  ,  En  ,    if 

(1)  k-     is    L- equivalent  to    k.     , 

(2)  k^    is  true,    and 

(3)  some  disjunctive  question    q    in    Q;         is  such  that  m^ 
is  a  direct  answer  to    q   and  k-       L-implies  some    di- 
rect  answer     d    to    q  , 

then  the  news -value  of    E^  =    0  . 

Proof:   Since     d    is     L-implied  by    k^    and    k^    is  true,     d    is 

true  also.     We  consider  two  cases: 


'See  Goldman  [1953],    Chaps.  I  and  IX. 
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(a)  Suppose    m-     is  true.     Since    q    is  disjunctive,    m^  =  d 

and    m;      is"  L-implied  by    k.         and  hence  by   (k:  &  umt-  ,). 

— i  —  —i»Il  j,  -  1_1 

Thus,    by  Theorem  13.10,    (m.j  &  furnt^")      L-implies 

fumt.:  _..     Since    k-  „     L-implies    m- ,    7  fumt:*  &  k-      )    L- 
i,n  — i,n    —        r  — 1»     *  1         — i>H    — 

implies    (fumt^  n  8t  L     ), 

(b)  Suppose    m^    Ts"~false.—  Then    (m^  &  k^)    is    L- false  and, 
by  Theorem  13.9,    (fumt'*   &  k^  n)~  L-implies    (fumt-  n 

&  ^  J- 

Theorem  14.5:   In  a  controversy    E     ,  .  .  .  ,  En,    if 

(1)  k^    is    L-equivalent  to    k.      , 

(2)  k^    and    m^    are  true,    and 

(3)  some  which  question    q    in    Q-         is  such  that    rrjd    is  a 

direct  answer  to    q  ,    k-        L-implies  some    direct  an- 

—      — L»H  — 
swer    d    to    q ,    and  for  every  pair  of  constants    a    and 

b    occurring  in  the  disjuncts  in  d    and    m-  ,     if    (a  =  b) 

is  true  it  is    L-true, 
then  the  news -value  of    E. i  =  0  . 

Proof:   By  Theorem  7.16,"  m^    and    d    are    L-equivalent,    so 
m^    is    L-implied  by    k^  n.  So    m^    is   also   L-implied  by 

(k[  &  umt^j).        Thus,by_Theorem  B~10,    (m^  &  fumt^)    L- 
implies  fumt^     .     Since    k-         L-implies    m^,~  (fumt^  &  k^  n) 
L-implies     fumt^       &  k^     )~  — 

It  could  happen,   of  course,   that    R    must  pay    S_    for  a  message 
which  is  merely  a  direct  answer  to  a  test  question;    this  can  oc- 
cur for  example  in  the  case   of  a  which  question,   where    certain 
constants    a    and    b    are  such  that    (a  =  b)    is  true  but  not  L_-true. 
We  should  not  be  surprised  or  dismayed  that  this  can  occur;      it 
simply  means  that  the  examiner  can  learn  something  from  the  ex- 
amined.    If    R    does  not  want  to  have  to  pay  for  answers  to   test 
questions,   he  can  confine  his  test  questions  to  disjunctive   ques- 
tions.    If  he  must  ask  -which  questions  as  test  questions    and   the 
conditions    of   Theorem    14.5   do  not   obtain,   he    can   still   cross- 
examine    S    about  any  suspicious  answer;  one  possibility  for  cross- 
examination  is  suggested  by  (4)  in  the  proof  of  Theorem  7.16. 

The  following  theorem  is  quite  important.   It  says  in  effect  that 
if    R'  s   usable  knowledge  keeps  growing  and  his  information  val- 
ues suffer  no  deflation,    then  the   amount   of  news -value   he   will 
have  to  pay  for  in  a  controversy  does   not   exceed  the   amount  of 
information  which  is  added  to  the  usable  part  of  his  initial  knowl- 
edge by  the  usable  messages  and  his  final  knowledge. 

Theorem  14.6:    Let    E  ,   .  .   .  ,  En    be    a    controversy  where, 
for  every    i    such  that    1  <  i  <  n",    Ij(p    |  p   )  ^  I.     (p    |  p  )  and 
LJfumt/'  &  ki2I)  £  Ii(fumti*  &  k^^  '^~iltJ.     TlTen 

n 

IJfumtn  &  kj  |  k1>2J  ;>  ^  k[(h,n  &  fumti,n)  I  &i,n   &  fumti*>]- 

i=i 
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Proof:   Note  first  that,   by  Theorem  11.7, 
Ii[(ki,n  &  fumti,n)  I  ^i,n  &  *>»*£)]  =  ^[fumt^  |  (k^  &  furmf)]. 

Since    (umtn  &  kn)    L_-implies    (fumtn+1*  &  knjn_),it  suffices  to 
show  that,  for  each  i   from    1    to   n  ,  ~l^[(fumt^,  *  &  k^  n)  |  k      ] 
is  an  upper  bound  of  the  sum  of  the  first    i    terms  6i  the  sufri- 
mation.     This  can  be  shown  by  induction  on    i,    as   follows: 

Base:    Let    z  =  Ijfumtj  n  I  (kj  n  &  fumt*)] 

=  I^fumt^-  &  k/-  &  fumt*)   -  LJk^  &  fumt*). 

Now   fumt*    is    (p  v  -p)    and  fumtj  n    is    fumt       * 

So    z  =  I^fumt^;*  &  k1>n  &  k      )  -^(kln) 
=  I1[(fumt1+*fcJ5|-)|]|li3. 

Step:   We   assume   that 

i-i 
li.^fumtf  &  ki:_1>31)  |  k1>n]   £  £ Ij[fumtj>s  |  (kjjE  &  fumtj*)]. 

i 
Let    z=    ^    Ij[fumtj>II|(kj)21&  fumtj*)]. 

Thus    z  <  ^[(fumtj*  &  k^J  |  kla]  +  Ii[fumtija  |  (k^  &  fumtj*)]. 
By  hypothesis  we  may  change    I.       here  to    1^  ,    so 

z  <c  I^fumt^  &  k^  &  kla)  -  LJk^) 

+  ^(fumt^  &  kisI1  &  fumtf)  -  IjO^n  &  fumti*)- 

By  hypothesis,  I^fumt^*  &  k.J  £  I^fumtj*  &  k^n  &  k^). 
So  z  ^  ^(fumtj*  &  kLa)  -  l^J  +  Ii(fumtLa  &  k"^  &  fumtf ) 
-^(k,  n&  fumtf), 


i       i>ii 


so    z  £  LJfumt^  &  ki)a  &  fumtj*)  -  liUs^). 
But   (fumt^  n  &  fumt^*)    is  L_-equivalent  to   fumt.    *  ,    and 
QSLn&^n)    L-i^plies    k^. 
So    z  <  Ii(fumti+*  &  kia  &  k12L)  -  Ljk^) 
^  I.  [(fumt.  ,  *  &  k.     )  I  k      ]  . 
$    $    $    $    $ 

In  the  system  developed  above,  actual  news-value  computations 
are  made  at  times  which  may  be  relatively  far  apart.  At  the 
time  of  any  E  •  ,  however,  R  is  free  to  estimate,  via  P^ ,  the 
likelihood  that  various  r-possible  wffs  will  be  received  as  Sii+j  • 
From  this  estimate,   and  analogous  ones  concerning    k^+j  ,    Qi+j> 
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and  I; , -,   R   can  compute  the  mathematically  expected  news-value 
of    E-~-~.    A  conjecture  system  of  a  different  sort  is  the  following: 

Definition  14.4:   fumt- /*   is 

(1)  (p  v  -p)    in  case   _i  =   1  ;    and 

(2)  (fumt     •  &  fumt     •&...&  fumt.       .)    in  case    j.  >  1  . 

Definition  14.5;   An  incipient  news -value  datum,  for     E^    in  a 


controversy    E 


E  ,  .  .  .  ,  E„    is  a  value  of 


-n 


Ii[P  I  QSi&  fumt^*)], 

where 

(1)  p  is  a  conjunction  of  statements  r_  such  that  r_  is  L- 
implied  by  (k^  &  umt^)  and  if  r_  is  not  L-true  then  r. 
is  a  direct  or  partial  answer  to  a  question  in    Q^ ,    and 

(2)  if    p    is  not   L-true  then    p    is    L-implied  by    DA,  where 

(a)  DA  is  a  direct  answer  combination  of  some   non- 
empty subset  of    Q_^,  and 

(b)  (DA  &  k^  &  umt^)   is"  consistent. 

Hereafter  we  shall  usually  omit    'in    E  ,  .  .  .  ,  En'    following 
'incipient  news -value  datum  for    E^'. 

Theorem  14.7:   In  a  controversy    E  ,  .  .  .  ,  En  the  set  of  in- 
cipient news -value  data  for    E^    has  a  least  upper  bound. 
Proof:   Incipient  news -value   data   are   values   of     I(p  &  k^ 
&  fumt-/-*)  -  I(k^  Ik  fumt^*),   hence  are  bounded  above  by 
I(p  &:  k-  &  fumt-  7.*),  and  hence  by  I(k^  &  umt^,  which  is  finite. 

Definition  14.6:   The  incipient  news-value  of    E^   in   a   con- 
troversy   E  ,  .  .  .  ,  En    is  the  least  upper  bound  of  the  set  of 
incipient  news -value  data  for     E^  . 

Hereafter  we  shall  usually  omit    'in    E  ,  .  .  .  ,  En'    following 
'incipient  news -value  of    E^'. 

This  concept  can  be  usecfby    R    to  determine  how    surprising 
the  controversy  is;    at    En    he  compares  the  incipient  news -value 
of    Ej_    with  the  actual  news -value  of    E^  .        The   justification  for 
this  interpretation  is  provided  by  the  following  theorem: 

Theorem  14.8:    In  a  controversy    E.  .  .  .  ,  En    the    incipient 
news-value  of    En    is  equal  to  the  news-value  of    En  . 
Proof:   Incipient  "news -value   data  for     E^    are  values    of 
In[p  j  (kn  &  fumtn/n*)],    where    p    is  defined  in  terms    of     Qn 
and    (k„  &  umt^.-   But  the  news -value   data   are   values  of" 
In[P  I  (kn  n  &  fumtn*)]>    where    p    is  defined  in  terms  of  QnjT1 
and"  (kn~n_&  fumt^).     Since    Q^    is     Qn  ,     k^    is    L- 
equivafent  to    k^  ,        fumtn  n    is     L, -equivalent  to    umtn  ,     and 
fumt^'     is  identical  with  fumtn/n*,    the  incipient  news -value 
data  here  coincide  with  the  news  -value  data. 

sje      ijs      sje      $:      %: 
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Suppose    R    wishes  not  only  to  pay    S     for    the    news -value    of 
usable  messages,    but  also  to  impose  penalties,    such   as    deduc- 
tions from  cash  payment.     Just  as  one    can   evaluate    a    sequence 
of  messages  on  many  dimensions,    so  one    can   penalize    on   many 
dimensions.     For  the  _I- value  dimension,     R    might  use  a  system 
based  on  the  following  concepts: 

Definition  14.7:   A  reject  of    E ^    in  a  controversy  E ,   .  .   .  ,Kn 
is  a  message  in    m  ,   .  .   .  ,  m^  "which  is  not  a  conjunct  of  umt^. 

Definition  14.8:  F  is  a  new  reject  in  E j  in  a  controversy 
E. ,   .   .   .  ,  E       if  and  only  if 

(1)  F    is  "a  reject  of    E^    in    E En  ,    and 

(2)  if    j.  >    1,    then    F    is  a  conjunct  of  umt .     . 

Definition  14.9:  The  particular  message  rejection,  pmr^,  of 
E_i  in  a  controversy  E  ,  .  .  .  ,  E_n  is  the  alphabetically  "first 
statement  which  is  formed  by  conjoining  to  (p  v  -p)  all  the 
rejects  of    E^    which  are  new  in    Ej  . 

Definition  14.10:  The  final  rejection,  fr,  in  a  controversy 
E  ,  .  .  .  ,  En  is  the  alphabetically  first  statement  which  is 
formed  by  conjoining  to  (p  v  -p)  all  of  the  rejects  of  En  in 
E^,  .  .  .  ,  Ejj. 

Definition  14.11  :  The  particular  I-penalty  basis  for  E-  in  a 
controversy    E  ,  .  .  .  ,  En    is    [(pmr^  . 

Definition  14.12:  The  final  I-penalty  basis  for  a  controversy 
I, Ea    is    I(fr). 

Definition  14.11  would  be  used  merely  for  record-keeping  pur- 
poses; a  graph  showing  particular  I-penalty  bases  would  help  to 
reveal  how  erratic  S1  s  behavior  was.  Definition  14.12  might  be 
used  for  levying  actual  cash  penalties.  Since  I(fr)  might  be  in- 
finite, R  and  S  must  agree  on  a  way  of  scaling  down  the  I- 
penalty  for  computing  cash  deductions. 

jfe        $!g        »fe        jjg        }fe 

It  might  seem  that    R    credits    m    with  too  little  news -value  in 
the  case  of  artificial  communication  events  which  are  constructed 
by    R    for  his  record-keeping  purposes,   for  example  in  accord- 
ance with  clause  (2)    of  Definition  12.5.     Suppose  that    m^     stim- 
ulates   R    to  enlarge    Q-    to    Q^"  .     We  have  assumed  informally 
that    Q-"    cannot  be    Q^  "but   may  be    Q.      ,   and  we  have  assumed 
formally  that  if    Qi+      is     Q^     then    m^     is    m^.        But   then   the 
news -value  of    E-  7.    will  be  zero,   because    m- .       is  discounted  in 
terms  of  fumt.    *,"  and  fumt.     *    contains    m^    a~s  a  conjunct.      The 
situation  is  the  same  in  all~cases  of  growth"  of  knowledge  or   in- 
terest,  or  change  of  mind.     But  this  is  no  defect  in  our    concept 
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of  news -value.     If  a  controversy  continues  past  a  growth  of  in- 
terest,   subsequent  messages  sent  by  S  will  be  rewarded  in  terms 
of  the  enlarged    Q.     A  graph  showing  news-values  might  register 
zero  during  a  sequence  of  growths  or  changes  of  mind,  but    such 
a  graph  can  be  interpreted  by    R    in  a  way  which  is   fair   to     S  , 
and    S    should  no  more  expect  pay  for  the  news -value  of  a  mes- 
sage repeated  by    R    than  for  a  message  repeated  by  himself.   If 
S    claims  that  he  should  be  paid  for  the  extension  of    Q     caused 
by    m^  ,   or  for  the  answers  to   the   questions   in  the   enlarged     Q 
which  are  answered  by    (k^  &  rmj  ,    R    can  reply  that     S     must 
prove  that  the  enlargement  of  Q    was  due  to    m^,  and  in  any  case 
the  pay  should  come  via  some  other  evaluation  "function,  measur- 
ing something  other  than  news -value .     Assuming  that    R    follows 
this  policy,   the  rational  course  for    S    is  never  to  put  all  his  eggs 
in  one  basket  —  that  is,   never  to  put  all  of  his  news  in  one  mes- 
sage.      For   his    initial  messages   he    chooses    some    statements 
which  will  arouse    R,    awaken  his  interests,    enlarge    Q.        Only 
after    Q    has  been  enlarged  does    S    communicate  the  main  body 
of  his  news.     Our  model  of  the  behavior  of    R    and    S     in  these 
respects  fits  the  case  of  many  empirical  examples.    Newspapers, 
for  example,   arouse  interest  by  their  headlines  but  they  tell  the 
complete  story  only  in  the  many  paragraphs  of  smaller   print. 

Besides  keeping  a  record  of  the  news-value  of    S's     messages, 
R    is   free   to   keep   track  of   such  other   functions   as     I(umt^)  , 
Ifum^  |  k^),    Iffumt^  n  |  (k^  n  &  fumt^*)],    and  so  on;    and  he  can  sup- 
plement his  payment  to   S  ,   as   base~d  on  news-value,   with  extra 
payments  based  on  these  other  functions.     One  system  of  evalu- 
ation which  might  be  usable  for    "tipping"    S  ,    and  might  also  be 
of  interest  in  other  respects  as  well,  is  suggested  by  the  following: 

Definition  14.13:   Let    E E^  ,..'.,  Ej_ ,.  ...  ,  E^  be  a 

controversy.     Then    E^    is  a  change  re    E-^  "if  and  only  if  E^ 
is  either  ~~ 

(1)  a  growth  of  knowledge,   a  growth  of  interest,  or  a  growth 
of  wonder  at  some    p    in    Ej En    with  respect  to 

(2)  a  change  of  mind  with  respect  to    E^  . 

Definition  14.14:   An  I-interest  datum  for    E^    in  a  contro- 
versy   E  ,  .  .  .  ,  E_k  ,  .  .  .  ,  Ej  ,  .  .  .  ,  En  ,  relative  to    E^  ,    is 
a  value  of 

Iifel(-l,n&  fumthJ)]' 
where 

(1)  p  is  a  conjunction  of  statements  r_  such  that  r_  is  L- 
implied  by  (k.  &  fumt.  )  and  if  r_  is  not  L-true 
then    r    is  a  direct  or  partial  answer  to  a  question   in 

Q.       ,    and 

— 1>H 
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(2)    if    p    is  not   L-true  then    p    is    L-implied  by  DA,  where 

(a)  DA  is  a  direct  answer  combination  of    some   non- 
empty subset  of    Q-        and 

(b)  (DA  &  k^  &  umtn)  Ts- consistent. 

Definition  14.15:     The   I-interest   of     Ej     in  a   controversy 
E  ,  .  .  .  ,  En  ,    relative  to    E^  ,    is  the  least  upper  bound  of 
the  set  of  I-interest  data  for    E^    in    E  ,  .  .  .  ,  En  ,  relative 
to    Eh  . 

Definition  14.16:    Let    E  ,   .  .  .   ,  En    be  a  controversy.  Then: 

(1)  If    E^    is  not  a  change  re    E-     ,  then  the  interest-value 
of    E^    in    EL,  ...  ,  En     is   "the  news -value  of  E^. 

(2)  If    E^    is  a  change  re    E-_  ,    then  the  inter  est -value   of 

E^    in.    E En    is   T:he  I-interest  of    E^  in   E  ,  .  .  . 

En  ,    relative  to    E^,  where    E,       ,  .  .  .  ,  E^     is   the 
longest  segment   E^ ,  .  .  .  ,  E^— of    E,,  .  .  .  ,  En  such  that 

(a)  Ek  =    E^  and  "~ 

(b)  for  all    g    such  that    j  ^  g  ^  k  ,    Eg    is  a  change  re 

Erl.      "  '      ~      '       " 

This  last  definition  could  be  used  for  analyzing  the  rhetorical 
behavior  of  S  and  possibly  inferring  his  rhetorical  plans.  For 
analyzing  the  immediate  psychological  effect  of  m  ,  .  .  .  ,  m^  on 
R  we  might  construct  definitions  analogous  to  Definitions  14.14, 
14.15,  and  14.16,  putting  'k^'  for  'k^  n',  'umt^'  for  'fumt^  n\ 
and  so  on. 

Obviously,    R    cannot  pay  cash  to    S    solely  on  the  basis  of  in- 
terest-value;   during  a  sequence  of  growths  of  interest  one  mes- 
sage will  be  repeated,  and  each  repetition  will  earn  interest-value 
which  overlaps  previously  earned  interest-value.    Therefore,    if 
R    wants  to    "tip"   S    on  the  basis  of  interest-value,   he  must  use 
some  system  beyond  what  we  have  provided  him  with  thus   far. 
The  natural  suggestion  is  that,    if    E^  ,  .  .  .  ,  E^.    is   a    sequence 
of  changes,    R    should  tip    S    according  to  the  interest  of     E;  , 
where    h  ^  j  <  k    and    E:     is  related  in  some    special  way  to 
E^,  .  .  .  ,  E-  ,.     There  are  many  problems,  however,  concerning 
what  this  special  relation  should  be.     If  changes   of  opinion  or 
growths  of  knowledge  occur  in    E ^  ,   .  .  .  ,  Ej_  ,,    the  tip  might  be 
enhanced  or  cut  down,  independently  of  whether   the    change   or 
growth  was  due  to    S's    messages.        We   might   require   that 
E k  ,  .  .  .  ,  E;    be  a  sequence  of  growths  or  changes   of  interest, 
and  that  it  be  the  longest  such  sequence  beginning  with    E^  ,   but 
again  we  have  the  problem  of  separating  the  growths  and  changes 
caused  by    S^   from  those  caused  by    R    himself.     It  would   seem 
that,   at  this  stage  of  our  analysis,    S    must  be  content  with  what- 
ever ad  hoc  tipping    R    cares  to  indulge  in. 


15.   MEANING 


In  this  section  we  shall  use  the  apparatus  developed  in  preced- 
ing sections  to  define  several  concepts  which  might  be  relevant 
to  the  theory  of  meaning.  These  concepts  might  in  particular  be 
useful  to  S  in  planning  rhetorical  strategies  against  R;  con- 
versely they  might  be  useful  to  R  in  analyzing  the  effect  of  S's 
communication,  and  in  analyzing   S's    communicative  style  in  general. 

We  shall  use    'meaning1   here  as  a  theoretical  term  rather  than 
an  observational  term.1    We  assume  that  different  inquiries  with 
different  purposes  may  introduce  different  concepts  of  meaning.2 
We  do  not  claim  that  our  concepts  will  be  useful  in  other  areas; 
in  particular,  we  do  not  claim  that  they  will  be  useful  in  the  anal- 
ysis of  belief  or  translation  situations.       We    shall  proceed  with 
the    general  aim  of  defining   types  of  meaning;    but  if  the  reader 
believes  that  what  we  are  defining  are  not  types  of  meaning,  then 
he  may  substitute  some  other    expression  for    'meaning'.         Our 
main  objective  in  this  section  is  not  to  define  concepts  and  argue 
that  they  are  concepts  of  meaning;    our  main   objective  is  to  es- 
tablish  Theorems   15.4,    15.5,   and  15.6      These   theorems   are 
phrased  in   "meaning"   terminology,  and  we  hope  that  they  provide 
a  connection,   however  tentative  and  partial,   between  our  theory 
of  news-value  and  the  theory  of  meaning;  but  their  most  important 
function  is  to  express  a  certain  property  of  news-value  and  thereby 
help  to  justify  our  analysis  of  news -value.   It  will  become  evident 
that,  even  if  our  "meaning"  terminology  were  omitted  altogether, 
Theorems  15.4     to  15.6  would  still  retain  their  validity  and  interest. 

The  general  type  of  meaning  we  shall  concentrate  on  in  this 
section  is  contextual  meaning  —  that  is,  the  meaning  of  a  mes- 
sage to  R  in  a  given  communication  context.  In  the  first  few 
definitions  we  shall  confine  our  attention  to  logically  extractable 
meanings.  Our  guiding  ideas  here  are  that  the  logically  extract- 
able  meaning  of  m  is  the  wffs  which  can  be  logically  extracted 
from  m,  and  that  different  methods  of  extraction  define  differ- 
ent subtypes  of  meaning. 

Definition  15.1:  The  answer-set  of  Q  is  the  set  consisting 
of  all  L-true  wffs  and  all  direct  or  partial  answers  to  ques- 
tions in  Q.  Given  a  set  Q,  we  shall  denote  its  answer-set 
by   'AQ'.~  "~ 


1  In  the  sense  of  Hempel  [1952]  . 

2  For  a  defense  of  this  assumption  see  Osgood  [1957],  especially  pp.  2ff. 
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Definition  15.2:     Let    E^     be    a   member    of   a    controversy 
E^  .  .  .  ,  Ea.     Then: 

(1)  The  first-order  k-meaning  of    m^    in    Ej   in   E 

Ejj    is  the  set  consisting  of  all  wffs   which  are    L-im- 
plied  by    (k^  &  rrij)  . 

(2)  The  second-order  k-meaning  of    m^    in    Ej    in    E  ,  .  .  .  , 
En    is  the  set  consisting  of  all  wffs  which  are  L-implied 

by~  (ki  &  Slj  &•  •  •  &  2ii)- 

(3)  The  third-order  k-meaning  of    m^    in    E^    in    E ,  .  .  .  , 
En    is  the  set  consisting  of  all  wffs  which  are  L-implied 
by"  (ki  &:  umpi  &  m^)  . 

(4)  The  fourth-order  k-meaning  of    m^    in    Ej   in   E. 

En    is  the  set  consisting  of  all  L-true  wffs  in  case    m^ 
is  not  in  umtj   and  all  wffs   which  are    L-implied  by 
(k^  &  umtjj    in  case    m^    is  in  umt^. 

(5)  Tne  fifth-"order  k-meaning  of    Hli~i£    E_i    in    Ej,  .  .  .  , 
En    is  the  set  consisting  of  all  L-true    wffs  in  case  m^ 
is- not  in  fumt-        and  all  wffs  which  are  L-implied  by 
(— i  n  ^  fumti  n)~"  ^n  case    331i    ^s  ^n  fumt^  n  . 

Hereafter  we  shall  usually  omit   'in    E^    in  E En'    fol- 
lowing   'k-meaning  of    m^' . 

Definition  15.3:   Let    E^    be   a   member    of  a   controversy 
Ej,  .  .  .  ,  Ea.     Then: 

(1)  The  fir  st -order    Q-meaning  of    m^    in    E^    in   E  ,  .  .  .  , 
En    is  the  intersection  of    AQj     and  the  first-order   k- 
meaning  of    m^. 

(2)  The  second-order  Q-meaning  of  m^  in  Ej  in  E,,  .  .  .  , 
En  is  the  intersection  of  AQj  and  the  second-order  k- 
meaning  of    m.j. 

(3)  The  third-order    Q-meaning  of    m^    in    E^    in    Ej,   .  .  .  , 
E^    is  the  intersection  of    AQj     and  the  third-order    k- 
meaning  of    m^. 

(4)  The  fourth-order    Q-meaning  of    m^    in   E j    in   E 

En    is  the  intersection  of    AQj    and~the  fourth -order  k- 
meaning  of    m^. 

(5)  The  fifth-order    Q-meaning  of    m^    in  Ej   in  E  ,  .  .  .  , 
Ejj    is  the  intersection  of    AQ-;  n  "and  the  fifth-order  It- 
meaning  of    m^.3 

Hereafter  we  shall  usually  omit   'in    E^    in    E. En'    fol- 
lowing   'Q-meaning  of    m^' . 

In  both  of  the  last  two  definitions  the  order  of  the   "orders"   is 
intended  to  be  an  epistemological  order,  rather  than  a  logical  one. 
Logically,   all   of  the   orders   are    contained  in  the    second   one. 
Epistemologically  the  orders  represent  progressive  refinements 


The  form  of  Definitions  15.1  to  15.3  was  suggested  by  Belnap. 
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of  analysis;   R    receives    m^    and  then  successively  considers  its 
meaning  in  relation  to  (1)  itself,    (2)   all  the  other  messages,  (3) 
umpi  ,    (4)    umtj  ,    and  (5)    fumt-  n  . 

It  Is  perhaps  worth  noting  that  a  message    m^    can  be    "mean- 
ingful"  at  some  levels  but   "meaningless"   at  others,    in  the   fol- 
lowing sense:    Suppose    m^    is  neither    L-true  nor    L-false,   and 
that  m-    is  not  in  umtj  .     Then  the  first-,  second-,  and  third-order 
Q-mea~nings  of    m^    might  consist  of  more  than  just  L-true  wffs, 
while  the  fourth-  and  fifth-order    Q-meanings  consist  only  of  L- 
true  wffs.     On  the  other  hand,    every  message    "has  meaning"   in 
the  sense  that  its  first-,  second-,  third-,  fourth-,  and  fifth-order 
k-  and  Q-meanings  are  all  nonempty;    they  all  include  the  infin- 
itely many  L-true  wffs. 

Can  we  analyze  the  meaning  of    m    itself?    That  is,    can  we  iso- 
late that  part  of  the    Q-meaning  of    m    which,   although  extracted 
by  means  of    k    and  other  messages,    is  nevertheless  contributed 
by    m    itself?    As  a  first  step  we  offer  the  following  definitions: 

Definition  15.4:   Let    E-    be   a  member    of  a   controversy 
E  ,  .  .  .  ,  En  .     Then,  for  every    y    such  that  1  ^  y  ^  5,   a 

datum  for  the  I- contribution  of    m^    to   the    y order  Q- 

meaning  of    m^    in    Ej    in    E  ,  .  .  .",  En    is    a  value   of 
I-(p  |  s)  ,   where    p    is  a  conjunction  of  statements    r_    such 
that    r_    is   in  the   y       order    Q-meaning  of    m^    and  if     r_ 
is  not  L-true  then    r_    is  not  L-implied  by    s_,   and 

(1)  if    y  =  1,    then     s_    is    k^    and    p    is  L-implied  by    some 
DA    of    Qj    such  that  if"  p    is  consistent  so  is     DA, 

(2)  if  y  =  2, "then  s_  is  (k^  &  rr^  &.  .  .&  rn^)  and  p  is 
L-implied  by  some  DA  of  Q^  such  thaf  if  p  is  con- 
sistent so  is     DA, 

(3)  if    y  =  3  ,  then    s_    is     (k^  &  umpj)     and    p    is    L-implied 
by  some     DA    of    Q;     such  that  "if    p    is  consistent    so 
is     DA, 

(4)  if    y  =  4  ,    then    s_    is    (k^  h  fumti fy   )    and  if    p    is   not 
L-true  then    p    is    L-irriplied  by    some     DA    of   some 
nonempty  subset  of    Q^    such  that    (DA  &  k  •  &  umt^)  is 
consistent,    and 

(5)  if    y  =  5  ,    then     s_    is     (k^       &  fumt*)     and  if    p    is   not 
L-true  then    p    is    L-irripTied  by    some     DA     of   some 
nonempty  subset  of    Q-         such  that    (DA  &;  k     &  umt   ) 
is  consistent.  ~ 

We  shall  abbreviate  the  above  five  definienda  by   'datum  for 
the   1 -I- contribution  of    m^',    'datum  for  the  2-I-contribution   of 
m-',    and  so  on. 

Theorem  15.1:   Data  for  the  1-,    2-,    and  3 -I- contributions  of 
m-     range  between    0     and    oo  ,    inclusive. 
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Proof:   Suppose  that  the     s_    in  Definition    15.4   is    consistent. 
It  might  L-imply    m- ;    on  the  other  hand,    m.j    might  _L-imply 

some    L-false  direct  answers  to  questions  in    Q. 

— : i>  EL 

Theorem  15.2:    Data  for  the  4-_I- contribution  of    m.j    can  be 
zero  but  are  bounded  above  by    Ij(umtj  &  k-)  . 
Proof:    The  data  can  all  be    zero  "if    rn^    is  "hot  in  umt-  .      In 
any  case,   p    is  L- implied  by    (umt-    &~kj),  which  is  consistent. 

Theorem  15.3:   Data  for  the  5 -I- contribution  of    mj    can  be 
zero  but  are  bounded  above  by    L(fumt-       &  k-     )  . 

Definition  15.5:    Let    E-     be    a   member    of  a    controversy 


E: ,   .  .  .  ,  En  .      Then ; 


(1)    The  4-I-contribution  of    mj    in    Ej    in    E  , 


E 


— n 


is  the  least  upper  bound  ofthe  data  for  the  4-I-con- 
tribution of    m-  . 
(2)    The  5-I-contribution   of     ny     in    Ej    in    E.,   .   .  .  ,  En 
is  the  least  upper  bound  of  the  data  for  the  5-I-con- 
tribution of    mj  . 

Hereafter,   we  shall  usually  omit   'in    Ej    in    E_  ,  .  .  .  ,  E   ' 
following    'contribution  of    m_j'. 

Theorem  15.4:    In  a  controversy    E  ,  .  .  .  ,  En  ,    every  da- 
tum for  the  4-I-contribution  of    m-     is  an  incipient  news- 
value  datum  for    E  j  ,    and  every  datum  for  the  5-I-contribu- 
tion of    mj    is  a  news -value  datum  for    Ej  . 
Proof:    This  follows  directly  from  the  definitions  involved. 

Theorem  15.5:  In  a  controversy  E  ,  .  .  .  ,  En  ,  if  either  mj 
is  in  umtj  or  umtj  is  L-equivalent  to  umt-  ,  then  the  in- 
cipient news-value~of  Ej  is  identical  with  the  4-I-contribu- 
tion of    m-  . 

Proof:   In"view  of  Theorem  15.4,    it  suffices  to    show  that, 
under  the  conditions  of  the  theorem,    every  incipient  news- 
value  datum  is  a  datum  for  the  4 -_I- contribution.     Let   z^  be 
any  incipient  news -value  datum    I-[p  |  (kj  &  fumtj/j*)].     Now 
either    p    is    L-implied  by    (k-  &  fumtj /*")    or  it  is  not.  Sup- 
pose it  is;    then    z  =  0    and    z~  thus   ii  "identical  with  a  zero 
value  in  the  set  of  data  for  the  4-I-contribution.       Suppose  it 
is  not;    still,    it  is  L-implied  by    (k-  &  umtj)  .     Now  if    m.j   is 
in    umtj  ,    then  the  fourth-order  Q-meaning  of    mj    includes 
direct  and  partial  answers  to  questions  in    Qj  which  are    L- 
implied  by    (kj  &  umtj),    so    z    is  a  datum  for"the  4-I-con- 
tribution.    If  "umtj   is  L-equivalent  to    umt-     ,   then  fumt.      • 
is    L-equivalent  to   umtj  ,    and  thus    (kj  &  fumtj /j'1 )    L- im- 
plies   (kj  &  umtj)  ,    so    \  =  0    again. 

Theorem  15.6:   In  a  controversy    E_  ,  .  .  .  ,  En  ,    if  either 
m-     is  in    fumt;  _    or     fumt:  „    is    L-equivalent  to    fumt.  ,      , 
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then  the  news -value  of    E^    is    identical  with  the    5_-I-con- 

tribution  of    m^  . 

Proof:   Similar  to  that  of  Theorem  15.5. 

***** 

If    R    is  a  free  agent,    and  especially  if    S    must  sell  his   mes- 
sages to    R,    then    R    can  claim  that   Q-meanings    are   the    only 
meanings  he  is  interested  in.     If    R    is    subordinate   to     S  ,    and 
especially  if    S's    messages  come  as  commands  to    R,    then     S 
can  claim  that  there  is  more  to    m    than  just    R's    Q-meanings. 
For  example,     S    will  claim  that    R    must  respect  the  k-mean- 
ings  of    m.      This  claim  is  not  too  harsh,    for    S    realizes  that 
these  k-meanings  are  limited  by  the  content  of    R's    k  . 

The   k-  and   Q-meanings  defined  above  are  meanings    of    m    to 
R    in  relatively  small  communication  contexts.  It  might  be  urged 
that  meanings  in   "real  life"    are  relative  to  contexts   which  are 
much  wider  than  the  particular    controversies   being    considered 
here.     Can  our  analysis  provide  such  wider  contexts? 

One  approach  to  this  problem  is   provided  by  our   analysis    of 
tradition:    We  can  take  traditions  to  be  contexts  which  determine 
meaning.     Reserving    'm^',    'L^',    etc.  ,    for   the   members   of  a 
communication  event,   let  us  use    'mz^',    'Lzi',    etc.,    to  refer   to 
the  first,    second,    etc.  ,    members  of  the  tradition  event    Z^  .    Our 
first  step  would  be  to  define  for  histories  [recall  DefinitioiTl2.il] 
a  message -processing  procedure  analogous   to   that  for    contro- 
versies.    In  particular  we  would  define   the    concepts    of  umtzj  , 
fumtzi  n,    etc.     Then  we  would  define  the  traditional    k-   and  Q- 
meanings  of    m^    to    R    in  such  a  way  that  we  could  secure,    for 
example,    the  following:    Suppose     Z^,  .  .  .  ,  Zn   is  a  history  where 
some     Z-      among     Z  ,     .  .  .  ,     Zn    is    a   tradition   event   at    t    for 
E  ,   .   .  .  ~,  E       and  some    E^    among    E  ,     .  .  .   ,     Er     is  a  commu- 
nication event  for    R    at    t  ,    and  where    m-     L- implies    mzi .  Then 
the  fifth-order  traditional   Q-meaning  of    m^   in    Z_-j    in    Z_  7~.  .   .  >  Zlt\ 

is  the  set  consisting  of  all   L-true  wffs  in  case    m~  ■      is   not   in 
o  —  — zj 

fumt„:        and  all   L-true  wffs  and   all   direct  or   partial  answers 
p     to  "questions    in     Q    •  such   that     p     is    L-implied  by 

(-zj»n   &  fumtz|,n)     ^ "case    m^    is  in   furnt^. 

In  empirical  applications  of  our  analysis  of  tradition  meanings, 
'Z.,*  •  •  •  >  Z.n'  would  correspond  to  the  name  of  the  tradition.  For 
example,  a  historian  might  group  Z.  ,  .  .  .  ,  Z.n  together  and 
say  that  these  constitute  the  Malthusian  tradition  in  British  thought, 
and  we  could  then  interpret  'the  traditional  Q-meaning  of  m^  in 
Z,j  in  Z  i  .  .  .  ,  Z.n'  as  'the  Q-meaning  of  m^  in  the  Zj  phase 
of   the  British  Malthusian  tradition'. 

Allowing  rm  to  be  different  from  mz^  would  secure  a  useful 
generality  for'our  analysis.  The  group  of  R's  involved  in  Zx 
at    t     would  agree   to   focus   attention   on  a  publicly  inspectable 
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message,     rnzj.       Then    m^    would  be    a  large   part  of    R '  s    total 
thought,     including  both   m^j  and  his  private  interpretations  and 
elaborations  of  it.    Whatever     mi    might  be,  it  would  have  mean- 
ing in  the  tradition  only  by  way  of  the  usable  messages  in  the  se- 
quence   m^,   .  .  .  ,  m^. 

sic      A      sk      rfc      sic 

Finally,    to  suggest  still  further  possibilities  of  meaning  analy- 
sis,   we  shall  define  still  another  type  of  meaning. 

Definition  15.6:    The  depth-meaning  of    m^    in    E^    in  a  con- 
troversy   E  ,  ....  Ej_ ,  ...  ,  E^+rn  ,  .  .  .  7  En  ,    where 
l<Ci<i+m^:n,    is  "the  ordered  (m  +  1) -tuple    <E^  ,    E..     , 
.  .   .  ,     E  j+rn  >     if  and  only  if 

(1)  for  all    j     such  that    i  <  j  ^  i  +  m,    E;    is  a  change   re 
E •     ,    and 

(2)  if"  i  +  m  <  n,    then    E-  is  not  a  change  re    E-,       . 

Because  of  clause  (2)  of  Definition  12.5,    if    <E^  ,    .  .  .  ,    Ej+rn> 
is  the  depth-meaning  of    m^    in    E^  ,   then  for  every    h    such  that 
1  ^  2l  <  i+IS>    Hih  =  £3d  •     Thus  we   may  intuitively  think   of  the 
depth-meaning  of"  m^  "as  a    "thrill"   which,    initiated  and   sus- 
tained by    m^  ,    ripples  through    R    during  the  course  of  E^  ....  , 
E_i+m  .     Depth-meanings  are  personal  in  the  sense  that  they  are 
n -tuples  of  communication  events  for    R  .       But  depth-meanings 
are  not  intrinsically  unique,    secret,    or  private,   for  two   differ- 
ent   R's    can  enjoy  communication  events  whose  component  m's, 
L's,   and  so  forth,   are  respectively  identical.     Concerning   pos- 
sible applications  for  our  concept  of  depth-meaning,  we  have   at 
present  no  claims  or  conjectures;    perhaps  the  concept  will  have 
more  interest  when  we  add  to  our  model  a  utility  function  with  a 
status  similar  to  that  of  _I .     And  perhaps  we  may  eventually  be 
able  to  relate  our  analysis    of  depth-meaning   to   the   assertions 
put  forward  in  more  literary  treatments.4 


See,    e.g.,    Wheelwright  [1954], 


16.    MESSAGE   ADEQUACY 


It  is  commonly  believed  that,  when  S  uses  m  to  express  his 
thoughts,  describe  the  world,  or  persuade  R,  m  can  be  more 
or  less  adequate  as  an  expression,  description,  or  exhortation. 
In  this  section  we  shall  try  to  explicate  the  notion  of  adequacy  — 
or  rather,  since  we  assume  that  there  might  be  many  types  or 
dimensions  of  adequacy,  we  shall  try  to  analyze  several  types 
of  adequacy. 

Definition  16.1 :    Let    E  ,  .  .  .  ,  En    be   a   controversy,    let 
M(nii)    be  the  news -value  of    Ej   in    E  ,  .  .  .  ,  En  ,  and   let 

z.    be  the  sum  of  the  news -values    of     E Ej     in 

E  ,  .  .  .  ,  En  .     Then  the  sequential  I-importance  of    m^    in 
E^    in    E  ,  ."",  .  ,  En    is  the  ratio  of    [i  •  N(m.i)]     to     z_    In 
case    z>  0,    and  0~ otherwise. 

The  sequential  I-importance  is  the  degree  to  which    m^   "pulls 
its  weight"   in  the  message  sequence.      Where  all  the    m' s    have 
the  same  news -value,   the  sequential   I-importance  of  each  =  1  . 
If  no    m    except    m^    has  any  news -value,   the    sequential   I- 
importance  of  m^  =~i  .   If  M(nii)  =  0  ,  the  sequential  I-importance 
of    mi  =  0  . 

Definition  16.1  defines  a  very  simple  kind  of  adequacy;  it 
might  be  used  by  S  when  he  is  interested  in  constructing  an 
economical  message  sequence.  A  different  type  of  adequacy  will 
interest  S  and  R  when  S  wants  to  satisfy  R's  hunger  for  news 
and  R_  wants  to  evaluate  S  as  a  source  of  news.  Here  R  and 
S  must  consider  first  the  adequacy  of  S  in  meeting  R's  test 
questions,  and  second  the  adequacy  of  the  news  brought  by  S's 
actual  messages,  relative  to  the  news  which  would  be  brought  by 
a  perfect  message.  To  analyze  the  second  type  of  adequacy  we 
offer  the  following  definitions: 

Definition  16.2:    In  a    controversy    Ej,   .  .  .  ,  En  ,    Qi>nv    is 
the  set  formed  from   Q.        by  omitting  all  questions    q    such 
that  either 

(1)  some  direct  answer  to    q    is  L-implied  by  (kjjn  &  fumtj' ),  or 

(2)  q    has  no  true  direct  answers. 

Thus  Q.  *  might  be  empty,  but  if  it  has  members  then  each 
of  them  ha~s~a  true  direct  answer  and  none  of  them  has  direct  an- 
swers which  are  L-implied  by    (k^       &  fumt*). 
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Definition  16.3 :  A  news -adequacy  datum  for  m^  in  Ej  in 
a  controversy  Ej ,  .  .  .  ,  En  is  z.  if  and  only  if  Qi)rf~  is 
nonempty  and     z    is  a  value  of 

Ij[P  1  (fej,n  fe  ^™tj)] 
lit  DA  |  (kijS  &  fumtf )] 

where 

(1)  DA    is  a  true  direct  answer  combination  of    Q-     *  , 

(2)  Ii[DA  |  (ki,n  &  fumt*)]  ±  0  ,   and 

(3)  p    is  a  conjunction  each  of  whose  conjuncts  is  a  state- 
ment and  either  is    L-true  or  is  a  direct  or  partial  an- 
swer   r_   to   a   question  in     Q^  nv     such  that     r_    is   L- 
implied  by  both    DA    and    (k^  n  &  fumt^  n)  . 

For  the  type  of  adequacy  in  question  here,   we  can  let  the  ade- 
quacy equal  zero  in  the  case  where    Q^  n*    is  empty.  What  of  the 
case  where    LjDA  |  (k-       &  fumt*)]  =  0~?— This  case  does  not  arise 
for    Qj  n*    if  "I    is  sucn~that     if  "I(p    &  P2)  =  KEj)    then     p       L- 
implie¥~p  .     But  suppose    IjjDA  |  (k^n  &  fumtj*)]  =  0  ;    DA     L- 
implies    p  ,     so  by  Theorem  11.6,   Ij[p  |  (kj  n  &  fumtj  )]    =  0  ,     so 
we  can  let  the  adequacy  in  this  case  be  zero.     The  next  step  is  to 
combine  cases,   as  follows: 

Definition  16. 4:  The  news-adequacy  basis  for  m^  in  E.;  in 
a  controversy  E  ,  .  .  .  ,  E  is  the  set  consisting  of  zero" and 
all  the  news -adequacy  data- for    m^    in    Ej    in    E  ,  .  .  .  ,  En  . 

Theorem  16.1:   The  news-adequacy  basis  for    m^    in    E^    in 
a  controversy    E  ,  .  .  .  ,  E_n    has  a  least  upper  bound. 
Proof:   The  basis  is  nonempty,    since  it  includes  zero.  Since 
DA    L-implies    p,   every  news -adequacy  datum  ^  1;  see 

Theorem  11 .6. 

Definition  16.5:    The  news -adequacy  of    m^    in    Ej    in  a  con- 
troversy   E  ,   .   .   .   ,  En    is  the  least  upper  bound  of  the  news- 
adequacy  basis  for    rm_    in    E^    in    E  ,   .  .  .  ,  En . 

Hereafter  we  shall  usually  omit  'in  E^  in  E  ,  .  .  .  ,  En'  fol- 
lowing 'news -adequacy  datum  for  m^1  or  'news -adequacy  basis 
for    m^'    or    'news-adequacy  of    m-1. 

Theorem  16.2:  In  a  controversy    E  ,  .  .  .  ,  En  ,    if   (k^n  &  fumt^) 
L-implies    (k^       &  fumt^  n)  ,   then  the  news -adequacy  of    m^  =  0. 
Proof:   Analogous  to  that  of  Theorem  14.2. 

Theorem  16.3:   In  a  controversy    EL,  ...  ,  En  ,   if    kj    is  L- 
equivalent  to    k-        and    (m-  &  k^)    is  L- false, "then  the  news- 
adequacy  of    m^  =  0  . 
Proof:   Analogous  to  that  of  Theorem  14.3. 
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Theorem  16. 4:    In  a  controversy    E  ,   ....  En  ,    if 

(1)  k^    is  L_- equivalent  to    k-      , 

(2)  k^    is  true,    and 

(3)  some  disjunctive  question  q  in  Q^  n  is  such  that  m^ 
is  a  direct  answer  to  q  and  k-  L-implies  some  di- 
rect answer     d    to    q  , 

then  the  news -adequacy  of    m;  =  0  . 
Proof:   Analogous  to  that  of  Theorem  14.4. 

Theorem  16.5:   In  a  controversy    E,,  .  .  .  ,  E _,    if 

(1)  k-     is  L.-equivalent  to    k.      , 

(2)  k.      and    m-     are  true,   and 

(3)  some  whicE  question  q  in  Q^  n  is  such  that  m^  is  a 
direct  answer  to  q,  k^  n  L_- implies  some  direct  an- 
swer d  to  q  ,  and  for~e~very  pair  of  constants  a  and 
b  occurring  in  the  disjuncts  in  d  and  m^  ,.  if  (a  =  b) 
is  true  it  is  L-true, 

then  the  news -adequacy  of    m^  =  0  . 
Proof:   Analogous  to  that  of  Theorem  14.5. 

Theorem  16.6:   In  a  controversy    E  ,  .  .  .  ,  En    it  is  possible 
that  the  news -adequacy  of    m-  =  0     and  the   news -value   of 
E_i  *  0  . 

Proof:    Q-        might  have    DA's    which  are  consistent  with 

— i,n  °  

(k^  &  umtnJ"  but  not  have  any  questions  with  true   direct  an- 
swers. 

Theorem  16.7:    Let    E  ,  .  .  .  ,  En    be  a  controversy  where 
Qi  n'    *s  nonernPty  and    Qj  n      ^as  a  true     DA     such  that 
~(T)    LjDA  |  (kia  &  fumtf j]    +   0  ,    and 
(2)    DA    is  consistent   with    k^    and  with    (kn  &  umt^)  . 
Then  it  is  possible  that  the  news -adequacy  of    m^  =  1. 
Proof:    For    m-     choose     DA.     Since     DA    is  consistent  with 
k^    and  with    (k^  &  umtjj)  ,     m.j    is  in  fumt^  n    and   hence    is 
L7-implied  by  it.     So     DA    is  among  the    p's     of   Definition 
16.3,    and  thus  the  news -adequacy  basis  for    m^    includes    1 
as  a  value. 

Just  as  we  have  defined  news -adequacy  corresponding  to  news- 
value,  so  we  may  use  similar  methods  to  define  interest-adequacy 
corresponding  to  interest-value.  We  would  first  define  'I-interest 
adequacy  datum  for    m^    in    E^    in  a  controversy    E ,  .  .  .  ,  En  , 
relative  to     Ei*     then    'I-interest  adequacy  basis  for     m.j    in    E^ 
in    E  ,   .   .   .  ,  En  ,    relative  to    E^',    'I-interest  adequacy~of     mj~ 
in    E;     in    E  ,   .  .   .  ,  En  ,    relative  to    Eh',     and   then    'I-interest 
adequacy  of    m^    in    E^    in    E  ,   .  .  .  ,  E~',    this  last  definition  hav- 
ing a  rationale  "corresponding  to  that  of  Definition  14.16.  The  con- 
cept of  the  interest  adequacy  of  individual   messages    is    of  little 
interest  to  us  at  this  stage;     a  concept  of  greater  interest  will  be 
given  by  Definition  16.9. 
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Definition  16.6:  In  a  controversy  Ej,  .  .  .  ,  E^,  Qi,n  is 
the  set  formed  from  Q-  by  omitting  all  questions  which 
have  no  true  direct  answers. 

Definition  16.7:   A  datum  for  the  cumulative   Q-adequacy   of 
a  controversy    Ej,  .  .  .  ,  En    at    E^    is     z    if  and  only  if 

(1)  Qi  n**    is  nonempty,  ~ 

(2)  Q7  „**    ^as  no  L-true  direct  answer  combinations,  and 

(3)  z    is  a  value  of 

Ii<P) 


li(PA) 

(a)  where    DA    is  a  true  direct  answer  combination 

of    Qi  n**  '    and 

(b)  where'- p    is  a  conjunction  each  of  whose  conjuncts 

is  a  statement  and  either  is  L-true  or  is  a  direct 
or  partial  answer     r    to  a  question  in    Q^  nv*    such 
that    r_    is  L-implied  by  both    DA    and   (^f~n  &  fumt^  n), 

Definition  16.8:    The  basis  for  the  cumulative  Q-adequacy  of 
a  controversy    E  ,   .   .   .  ,  En    at    E^    is  the  set  consisting    of 
zero   and  all  the   data   for  "the    cumulative   Q-adequacy  of 
Jkj  >  •  •  •  »  En    at    E^  . 

Theorem  16.8:    The  basis  for  the  cumulative  Q-adequacy  of 
a  controversy    E ,   .  .  .  ,  En    at    Ej    has  a  least  upper  bound. 
Proof:    The  basis  has  zero"j~  so  is  nonempty.     In  every  datum 
z_    in  the  basis    DA    L_-implies    p  ,    so    I^(DA)  $-  I^(p)  ,    and 
thus     z_  ^  1 . 

Definition  16.9:    Let    E  .....  En    be  a  controversy.     Then: 
(1)    If    Q-     **    is  nonempty  and~has  L-true    DA's  ,  then  the 

cumulative  Q -adequacy  of    E  ,   .  .  .  ,  En    at    E ^  =  1 . 
(Z)    In   all   other    cases,    the    cumulative    Q-adequacy   of 

E  , En    at    E^    is   the   least  upper   bound  of  the 

basis  for  the    cumulative  Q-adequacy  of   Ex,   .   .   .  ,  En 
at    E^  . 
We   shall  say  that    S    is  completely  adequate  in   Ep   .  .  .  ,  E^ 
just  in  case  the  cumulative  Q-adequacy  of    Ej,   .  .  .  ,  En     at 
En  =  1  ,    and  completely  inadequate  in    Elf  .  .  .  ,  Efi    just   in 
case  the  cumulative  Q-adequacy  of    E  ,   .  .  .  ,  E^  at   En  =  0  . 

Clause  (1)  of  Definition  16.9  may  seem  objectionable,  on  the 
ground  that  in  this  case  the  adequacy  of  E  .....  E|  depends  on 
nothing  but  the  composition  of  Q^  n.  If  R  wants  to",  of  course, 
he  can  easily  secure  that  this  case"~hever  occurs.  But  it  need  not 
inconvenience  R  if  the  case  does  occur,  for  it  is  S  who  gains 
by  a  high  adequacy  score,  and  thus  R  can  put  the  burden  on  S 
to  prove  that    Q-    **    has     L-true     DA's. 
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At  the  time  of    En  ,    R    might  not  know  what  the  true    DA*  s     of 
Q-     *    or     Q-     **    are;    he  might  find  this  out  later,  or  never.    Thus 
the  "computation  of  news  -adequacy  and   cumulative    Q-adequacy 
might  be  considerably  delayed.     To  occupy  himself  while  waiting, 
R    might  compute  a  different  sort  of  adequacy,   defined  as  follows: 

Definition  16.10:   A  datum  for  the  possible -answer  adequacy 
of    m_i    in    E^    in  a  controversy    E  ,   .  „  .  ,  En    is     z^    if  and 
only   if 

(1)  Qi  n    is  nonempty, 

(2)  Qi  n    has  no  L_-true    DA's  but  does  have  consistent 
DA'  s  ,    and 

(3)  _z    is  a  value  of 

Ii(p) 


li(DA) 

where 

(a)  DA    is  a  consistent  direct  answer  combination  of 

Qi~n>    and 

(b)  p    is  a  conjunction  each  of  whose  conjuncts   is   a 

statement  and  either  is  L-true  or  is  a  direct   or 
partial  answer    r    to  a  question  in    Q^  n    such  that 
r    is  L- implied  by  both    DA    and    (k-       &  fumt-     ). 

Definition  16.11:    The  basis  for  the  possible-answer  adequacy 
of    m^    in    E^    in  a  controversy    E  ,   .  .  .  ,  En    is  the  set  con- 
sisting of  zero  and  all  the  data  for  the  possible-answer  ade- 
quacy of    m^    in    E^    in    E  ,  .  .  .  ,  En  . 

Definition  16.12:    Let    E  ,   .   .  .   ,  En    be  a  controversy.    Then: 

(1)  If    Qj  n    is  nonempty  and  has"  L-true     DA's,    then  the 
possible-answer  adequacy  of    m-     in    E{   in   E  ,   .  .  .  , 

(2)  If- Q.         is  nonempty  and  has  consistent    DA's    but   no 
L-true~  DA's,    then  the  possible -answer  adequacy   of 
raj    in    E^    in    E  ,   .  .   .  ,  En    is  the  least  upper   bound 
ofthe  basis  for  the  possible -answer  adequacy  of     m^ 
in    E^    in    E  ,   .  .  .  ,  En  . 

(3)  In  alf  other  cases  the  "possible -answer  adequacy  of  m^ 
in    E^    in    E  ,   .  .  .  ,  En  =  0  . 

Perhaps    R    also  wishes  to  compute  a  slightly  stronger  kind  of 
adequacy,    which  we  might  call  the  admissible -answer    adequacy 
of    m^  .     This  would  be  similar  to  the  possible-answer    adequacy, 
excepl  that  the    DA's    would  have  to  be  consistent  with    (kn  &umtn), 

The  following  theorem  might  be  useful  to  S  when  he  must  de- 
cide whether  to  scatter  his  news  over  a  message  sequence  or  put 
it  all  into  one  message. 
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Theorem  16.9:    Let    E  ,  .  .  .  ,  En    and    Es    be  two  contro- 
versies,   such  that 

(1)  %=    L_s. 

(2)  k-         is     L-equivalent  to    ks  , 

(4)  Jf=-ls      - 

(5)  fumt-         is  L- equivalent  to    mg  . 
Then:  "'""  ~ 

(a)  The  news -value  of    E^    in    E  ,  .  .  .  ,  En    might  not  be 
equal  to  the  news -value  of    Eg     in    E  s . 

(b)  The  news -adequacy  of    m^    in    E^    in    EL,   ...  ,  E_n 
might  not  be  equal  to  the  news -adequacy  of    ms     in   Es 
in    Eg. 

(c)  The  interest-value  of    E^    in    Ej,   .  .  .  ,  En    might  not 
be  equal  to  the  inter  est- value  of    Es    in    Es  . 

(d)  If    m^    is  in   fumt^  n    and    mg     is  consistent  with     ks  , 
then  the  fifth-order    Q-meaning  of    m.j    in    E^    in 

E  ,   .  .   .   ,  En    is  equal  to  the  fifth-order  Q-meaning  of 
ms     in    Es     in    Es . 

(e)  The  cumulative  Q-adequacy  of    E  ,  .  .  .  ,  En    at    E^    is 
equal  to  the  cumulative  Q-adequacy  of    Es    at    Es  . 

(f)  The  possible -answer  adequacy  of    m^    in    E^     in 

E  ,   .   .   .  ,  En    is  equal  to  the  possible -answer  adequacy 

of    ms     in    Es     in    Es  . 
Proof  of  (a)~and  (b):   News-value  and  news-adequacy  are  de- 
fined in  terms  of    I[p  |  (k-       &  fumt •*)],    where    p    is  L-implied 
by    (ki  n  &  fumt^  n)  .     Now~fumts*  "is     (p  v  -p)     and  hence  is 
vacuous.     But   fumtj*    might  notTBe  vacuous,    so    I[p  |  (k- 
&  fumt'*)]     might  be~less  than    I[p  |  (k^  n  &  fumtg')]  . 
Proof  of  (c):    E^    might  be  a  change  re    E^_j.     I-interest  data 
of    E ^  ,   relative~to    E^,    are  values  of     IjTp  I  (Ki  n  &  fumt._  *)], 

where    p    is  L-implieU  by    (k.       &  fumt;  ,""_).    But~the  interest- 
—  —  — L,n  i"i,n' 

value  of    E0     is  the  news-value- of    Ec,    which   is    defined   in 

terms  of    Is[p  |  (ks  s  &  fumtg'')],    hence  in  terms  of   I^(p  |  kj,  n), 

where    p    is  L-implied  by    (k-       &  fumt-      ). 

Proof  of  (d) :   Since    m^    is  in~fumti  n,    rfie  fifth-order    Q- 

meaning  of    m-     is  the~set  consisting  of  all  L-true  wffs    and 

all  answers  to  questions  in    Q-  which   are    L-implied  by 

(k-  „  &  fumt-      ).     Since    m_     inconsistent  with    k„  ,    m       is 
— i,n  i»n  — — s  a.      — s. 

iniumtg  s  .         So  the  fifth-order  Q-meaning  of    ms     is    the 
set  consisting  of  all  L-true  wffs  and  all  answers  to  questions 
in    Qs    which  are  L-implied  by    (ks  &  ms).     But    Qs  =  Q^  n  , 
and    (ks  fk  IBs)     *s  L-equivalent  to     (k^       &  fumt^  n). 
Proof  of  (e):~Q-     **  =  Qg  s**.     Suppose"  QijU**  ~ii    empty. 
Then  the  cumulative  Q-adequacy  of  both    Ej,-.   .   .  ,  En   at   E_i 
and    Es     at    Es     is  zero.     Suppose     Q^  n**    is  not  empty.  Then 
the  data  for  the  cumulative  Q-adequacy" of    Ex,   .  .   .  ,  En    are 


102  communication:     a    logical    model 

defined  in  terms  of    p    and    DA,   where    DA    is  a  true  direct 
answer  combination  of    Q.    **    and    p    has  as  conjuncts  either 
L-truths  or  answers  to  questions  in    Q-  J*1*    which   are    L- 
implied  by  both    DA    and    (k  n  &  fumt  ^) .     But  the  data   for 
the  cumulative  Q-adequacy  01    Es     are    defined  in  terms  of  p 
and    DA  ,   where    DA    is  a  true  direct  answer  combination  of 
Os  s**     and    p    has  as  conjuncts  either  L-truths  or  answers 
to  questions  in    Qs  g**   which  are  L-implied  by  both   DA   and 
(kg    &  mg)  .  ~'~ 

Proof  of   (f) :     This  is  via  analogous  methods. 

sic      5fic      s5c      sQc      ste 

It  might  be  possible  to  use  the  concept  of  cumulative  Q-adequacy 
to  explicate  the  notion  of  communicative  rapport.       The   obvious 
course  is  simply  to  identify  the  degree  of  communicative  rapport 
with  the  cumulative  Q-adequacy,    the  point  being   that  the    cumu- 
lative Q-adequacy  represents  the  degree  to  which    S    has  satis- 
fied   R's  information  wants.     Perhaps  the  communicative  rapport 
can  be  so  identified  only  if  the    "face -value"    I    used  in  our  anal- 
ysis thus  far  is  replaced  by  a  double -probability    I''*    of  the    sort 
discussed  in  connection  with  Definition  14.3.        Investigation   of 
these  matters  is  beyond  the  scope  of  this  book. 

A  generalization  on  the  idea  that  communicative  rapport   can 
be  identified  with  cumulative  Q-adequacy  is  the  idea  that  the  rhe- 
torical skill  of    S    is  correlated  with  the  cumulative  Q-adequacy 
of  his  messages.     Regardless  of  whether  we  identify  rapport  and 
cumulative  Q-adequacy,    then,    we    might    (1)   try  to    distinguish 
formally  various  modal  performances  defined   in   terms    of    R's 
enlargement  of    Q    and    S's     cumulative  Q-adequacy  score,    and 
(2)    investigate  empirically  the    relations   between  these   modal 
performances  and    S's     success  as  a  rhetor. 

It  might  seem  that  we  are  ready  to  attempt  an  explication   of 
the  notion  of  authority,    in  the  sense  of  reliable  expertness;      it 
might  seem  that  we  could  begin  by  stipulating,    for  example,  that 
S    is  an  authority  for     R    in  a  controversy    E  ,   .   .  .  ,  En    only   if 
every    i    from    1    to    n    is  such  that    Ij^rri    &.  .  .&  m-    )  ^  I^(m. 
& .   .  .  &t  m^)  <  oo  .    But  there  are  good~reasons  for  postponing  such 
an  attempt;    there  are  indications  that  the  notion  of  authority,  in 
any  interesting  sense,    is  a  much  more  complex  notion   than   we 
can  handle  at  this  stage  of  our  inquiry.     We  do  not  yet  know,  for 
example,    how  well  an  authority    S    must  perform  in   answering 
R's     test  questions.     It  is  not  simply  a  matter  of  how  many  test 
questions     S    must  answer  in  order  to  qualify  as  an  expert;     the 
problem  is  rather  how  to  determine  what  are  the  test  questions 
which    R    may  reasonably  demand  that    S    answer.    We  shall  not 
attempt  to  grapple  with  these  problems  here. 

1  Concerning  how  complex  the  notion  of  authority  or  expertness  might 
be,  see  Kaplan  [1950]  and  Helmer  [1958],  and  recall  the  arguments  in 
Gallie  [1957]  and  Harrah  [1959]. 
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If  a  message  is  interpreted  as  bringing  a  description  of  some- 
thing,   the  message  will  seem  to    R    to  be  more  or  less  complete, 
and  the  degree  of  completeness  will  depend  on    R's     degree  of  in- 
terest in  the  subject  matter.     Accordingly,    we    may   regard   our 
definition  of  cumulative  Q-adequacy  as  explicating  the   notion   of 
adequacy  of  description.     A  partial  justification  for  this  interpre- 
tation is  that,    very  roughly,    if    S     is    completely   inadequate    in 
E.,   .  .  .  ,  En  ,    it  is  because    S    tells     R    nothing  that    R    wants  to 
know  at    E^- ,    and  if    S    is  completely  adequate  in    E  ,   .  .  .  ,  En  , 
it  is  because    S    tells    R    everything  that   R   wants  to  know  at  En. 

On  the  other  hand,   we  have  not  yet  analyzed  adequacy  of   ex- 
pression.     We  do  not  have  in  mind  here  the  technical  problem  of 
the  definability  of  concepts  within  formal  systems.2    Rather    we 
have  in  mind  the  problems  —  belonging  to  pragmatics  —  associ- 
ated with  declarations  like    'Words  cannot  express  what  I  feel;  I 
could  try  to  express  it,  but  anything  I  said   would  be    inadequate'. 
Declarations  of  this  sort  have  been  generalized  by  philosophers; 
the  view  of  Whitehead,    for  example,    is  that  no  verbal  formula- 
tion can  express  a  proposition  adequately.3       Concerning   these 
matters  we  offer  the  following  definitions:4 

Definition  16.13:    S    is  open  at    En    in  a  controversy   E  ,   .   .   . 
En  ,    with  respect  to     Q  ,    relative  to    q  ,    if  and  only  if 
~~(l)   q    is  a  question  in  the  sense  of  Definition  7.7  and   Q   is 
a  proper  question  set  in  the  sense  of  Definition  9.1;  and 
(2)    for   some     DA    of    Q    and  some    direct   or   partial   an- 
swer   p    to    q , 

(a)  (k^  &:  DA  &  q  &  umtn)     is  consistent, 

(b)  (k^  &  DA  &  q  &  umtn)    L-implies    p  ,    and 

(c)  no  direct  answer  combination   F   of   Q   is  such  that 

(1)  (k^  &  F  &  q  &  umtn)     is  consistent  and 

(2)  (k^  &  F  &  q)  L-implies    p  . 

Certain  obvious  applications  result  when  we  let    Q    be    Q      and 
q    be  a  question  which    R_    thinks  of  after     E  ,   .  .   .  ,  En    has-been 
concluded.     A  more  important  application  is    in   connection   with 
the  concept  of  inexhaustible  openness. 

Definition  16.14:    S    is  inexhaustibly  open  in  a    controversy 
E  ,  .  .  .  ,  En    if  and  only  if  there    exists    an  infinite    set    Q 
such  that 

(1)  every  member  of    Q    is  a  question  in  the  sense  of  Def- 
inition 7.7,    and 

(2)  for  every  finite  subset    Q'     of    Q    there  is  a  question 


2  See  Tar  ski  [1956],    Chaps.  X  and  XIII. 

3  Whitehead  [1929]. 

For  further  discussion  of  these  matters  and  their  philosophical  im- 
portance, see  Harrah  [I960], 


104  communication:     a    logical    model 

q    which  is  in    Q    but  not  in    Ct ,    such  that    S     is    open 
at    E       in    E  ,  .  .  .  ,  E.n  ,    with  respect  to  the  union   of 
Qn    and    Q  ,    relative  to    q  . 

If  S.  wants  to  show  that  his  messages  are  inexhaustibly  open 
in  a  controversy,  then,  according  to  the  above  definition,  he 
must  construct  an  endless  sequence  of  questions  each  of  which 
reveals  further  content  in  his  usable  messages.  The  following 
example  shows  that  it  might  be  quite  easy  to  construct  a  mes- 
sage which  is  completely  adequate  in  various  respects  but  is 
also  inexhaustibly  open. 

Consider  the  situation  in  which    R    asks    'Is  the  direction     a 
covered  by  the  radar?'    and    S     replies    'Yes,    all  the  directions 
are  covered'.     Let  us  formalize  this  in  the  following  way:      Let 
Fx    express  that    x    is  a  direction  and    Gx    express    that     x     is 
covered  by  the  radar.     Let    L    be  such  that    a,  ,    a    ,     ...     de- 
note directions,    as  follows:    a      and    a      denote   two   different  di- 
rections,   a      denotes  the  direction  midway  between  the  directions 
denoted  by    a      and    a    ,    a      denotes  the  direction  midway  between 
the  directions  denoted  by    a      and    a    ,    and  so  on.      Let  it  be    the 
case  that  for  every    a^    neither    Ga^    nor     -Ga^    is  L-implied  by 
any  consistent  conjunction  whose  conjuncts  are  (1)    Fa^    or     ^Fa^, 
or  (Z)    any  statements  of  the  form    Fa;  ,    -Fa:  ,   Ga: ,   or  -Gaj  , 
where     a;   ^   a^  .     In  other  words,    the  report  "that  any  particular 
direction  is  covered   is   logically  independent   of  reports    about 
other  directions.     Let    k    be    Fa    ,    and  let    Q    consist  just  of  Ga1?. 
Finally,    suppose  that    m    is     (x)(Fx  ->  Gx),    and  that  both     k     and 
m    are  true. 

In  this  situation,    which  is  a  controversy  consisting  of  one  com- 
munication event,     m    is  completely  adequate  in  the  sense  of  Def- 
inition 16.9.     But    m    is  also  inexhaustibly  open  in  the    sense    of 
Definition  16.14.     It  is  inexhaustibly  open  because,  for  example, 
there  exists  the  infinite  set  consisting  of  the  following  questions: 


(Fa3  &  Ga3)"   v  (Fa3  &  -Ga3)v 
t4  &  Qa4)*  v  (Fa^  &  -Gat)* 


-Ga  V 


In  the  presence  of  examples  like  this  one,    R     might  take   the 
attitude  that  Definition  16.14  is  an  adequate  definition  of  a  notion 
which  just  happens  to  be  trivial,    and  that    S     should  be  given   no 
credit  of  any  kind  for  inexhaustible  openness.     Alternatively,  he 
might  feel  that  while  Definition  16.14  is  adequate,    the   notion   of 
inexhaustible  openness  is  important  and    S     should  be  given  some 
credit  for  inexhaustible  openness  in  certain  cases,    for  example 
those  cases  in  which    S    does   not  know   in   advance    the    content 
of    Q. 
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Another  attitude  for    R    to  adopt  is  that  inexhaustibility  ought 
to  be  an  important  property  and  that  not  many  messages    should 
possess  it.     Adopting  this  attitude,    R    might  wish  to  use  not  Def- 
inition 16.14  but  a  stronger  definition,    one  which  is  like    Defini- 
tion   16.14  except  that    S    is  required  to  be  open  relative    to   the 
union  of    Q    ,    Cf  >    and    Q  * ,    where     Q*     is  defined  in  such  a  way 
that,    if  the- (kn  &  DA  &  q  &  umtn)     mentioned  in  clause  (2-b)    of 
Definition  16.1~3    L_-implies    certain   universal   generalizations 
tSi)-   •  .  (x     )Fx    .  .  .  x      ,    then    Of*     contains  corresponding  which 
questions       (Ex  )  .  .  .  (Exm)Fx    .   .  .  Xj^  .     Exactly  which  universal 
generalizations  are  relevant  here  [out   of  the   possibly   infinitely 
many  such],    more    generally  how  any   definition    stronger    than 
Definition    16.14   ought   to   be    formulated,    and   more   basically 
whether  the  notion  of  inexhaustibility  ought  to  be  nontrivial,    are 
three  of  the  many  problems  we  do  not  solve  in  this  book. 

This  book,  since  it  must  end  at  some  point,  might  appropri- 
ately end  here.  We  assume  that  the  reader  has  a  high  standard 
of  inexhaustibility,  and  we  do  not  claim  inexhaustibility  for  the 
book.  We  do  not  claim  adequacy.  The  most  we  dare  hope  is 
that,  in  regard  to  certain  aspects  of  communication  and  possi- 
ble methods  for  analyzing  them,  this  book  has  effected  in  the 
reader  a  growth  of  knowledge,  or  at  the  very  least  a  growth  of 
interest. 
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